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METHOD AND CELLS TO SCREEN COMBINATORIAL^ I BRAR I ES 



Combinatorial chemistry (reviewed in Janda, 1994), in 
which a diverse library of laboratory-constructed molecules is 
screened for structures that exhibit properties of interest, 
is a relatively novel and potentially powerful technology, 
promising useful application in three general -areas. First, 
combinatorial chemistry may be used to identify molecules with 
highly specific adhesive or enzymatic activities useful in 
biomolecular research, clinical therapy, or diagnostics. 
Second, by screening a library for molecules that interact 
specifically with a known compound of significant interest, 
combinatorial chemistry may assist in identification of the 
known compound's native interactants (e.g. Phizicky et al., 
1995). Third, combinatorial chemistry may be used for epitope 
mapping, serving to elucidate the surface of structural 
interaction between two molecules of interest (e.g. Phizicky 
et al . , 1995) . 

The current combinatorial chemistry of proteins, powerful 
though it may be, has shown two distinct limitations. In many 
libraries, the diverse proteins are presented on a platform 
that fails to confer much structural integrity; as a result, 
the proteins can be too plastic to exhibit a stable shape and 
consistent properties. Equally ^ important, the- screening 
procedure can be labor-intensive, and factors outside of a 
protein's exhibition of the properties of interest often 
influence its isolation by d screen. This occurs in most 
screens reported in the literature 1 ( e . u . Phizicky et al., 
1^95). A related technology/ two-hybrid systems, 1ms proven 
useful in the latter two applications mentioned abovi-, namely, 
interaction studies and epitope mapping (Phizicky et al., 
1995) . However, two hybrid systems piac^ n very strict limit 
■>n V he Kinds • • i : 1 1 1 e i a . • * \ • i i ■ ii.it i;m y bo . . t : id : o< i : o. t r. 
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many of the difficulties encountered by other technologies. 
(The term polypeptide is used herein to comprise all types of 
peptides or proteins, including glycoproteins, proteoglycans, 
lipoproteins, acetylated proteins or (other) processed peptide 
or protein forms). It was a further object to provide a fast 
and reliable screening method for combinatorial or other 
peptide libraries, which is also versatile and broadly 
appl icable . 

These objectives are met by a method for identifying and 
isolating at least one of the following: a polypeptide of 
interest, a target compound or a gene of a polypeptide of 
interest or a gene of a target compound, wherein at least one 
of said polypeptide of interest or said target compound is 
unknown comprising the steps 

a) constructing a cell population by engineering it to 
express a membrane polypeptide which has said 
polypeptide of interest in its extra- or intracellular 
domain, said extra- or intracellular domain being 
exhibited in one or more copies on each cell's surface 
directly or indirectly interacting with a selection 
site specific for a selection agent, such that upon 
binding of said target compound to said polypeptide of 
interest said selection site of cells which exhibit 
such a binding of said target compound to said 
polypeptide of interest is modified in a manner that 
said selection agent may be used for selecting those 
cells which exhibit said binding of said target 
compound to said polypeptide of interest from cells 
which do not exhibit said binding, 

b) exposing said cell population to said target compound 
or a population of said target compounds for a period 
of time sufficient for said target compound to bind to 
said polypeptide of interest, 

c) exposing said cell population simultaneously with or 
after step h) to said selection agent for a period of 
time sufficient for said agent to interact with said 
selection site, 

d) selecting for clones which show said binding of said 
target compound to said polypeptide of interest , 
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e) optionally allowing said selected clones to multiply 
and 

f) isolating said polypeptide of interest and/or said 
target compound and/or said gene of said polypeptide 
of interest and/or said gene of said target compound 

In this invention, a library of polypeptides is exhibited 
in one or more extra- or intracellular domains of a membrane 
polypeptide expressed by a cell population. Selective 
pressures exerted by selection agents eliminate those cells 
whose polypeptide sequences tail to bind to a preselected 
target compound. Thus, from a cell population that may exhibit 

«_* v>* «■ ^ ^> v. j - • £' -~ ■ --- J [ ' f k «~" »— ^ >^ U V. * 1 ^- V- O ^ V_^- -4- _»_ »»14W^J\_ lj ^^U^.ll^CO 

bind the target compound are selected and e.g. amplified by 
growth. The term "polypeptide imprinting" is used to refer to 
the use of the method according to the present invention to 
select cells exhibiting polypeptides that bind the target 
compound or exhibit specific properties. 

With this invention, a wide variety of prokaryotic and 
eukaryotic polypeptides may be imprinted, serving e.g. 
creation 01 discovery of polypeptide sequences with high 
affinity tor a predetermined target, interaction studies, or 
epitope mapping. In more sophisticated embodiments of the 
invention, polypeptides may be imprinted to exhibit the 
following specifi :: [. r ope r t : es : mult ivulcnt binding to a 
predetermined target; high affinity for a target and low 
affinity for one or more molecules closely related to the 
target; inhibitory activity . n a biological pathway 

(antagonism); excitatory activity in ,i binlorjjoa.l pathway 

faqonisrn) ; r. r enr.ymat if c»et ivity. 

It l s a] so possible to screen riot only tor a polypeptide 

Library but also coincident iy or alternatively, tor a 
popu 1 a t i on of target me i ecu 1 ♦ > ; > . 
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This invention is the product of the confluence of three 
distinct areas of research: i. the study of prokaryotic and 
eukaryotic cellular membranes and their complex constituents; 
ii. the study of the evolution of microbial populations under 
the selective pressures of toxins, bacteriocins, or viral 
parasites; and iii . the design and screening of polypeptide 
combinatorial libraries. 

With the present invention, a polypeptide with promising 

properties can lmmeaiateiy ut i^iit^^-i 

isolated, making the present method extremely sensitive. It is 
possible to select one polypeptide that binds to a ligand out 
of a background of 10 6 non specific polypeptides as a minimum, 
but in there is no evidence for an upper^limit on sensitivity 
regarding the present method per se. 

With the present method all cells of the combinatorial 
library of polypeptides and/or target compounds are screened 
at once, rather than serially, providing results in far less 
time than many alternative technologies. The present invention 
is easy to use, because the important parameters are easily, 
adjustable for the man skilled in the art. Once a set of cells 
is selected, the genes for the polypeptides or the target 
compounds (if these are also expressed by the cells) are 

easily isolated. 

The polypeptide of interest may be displayed by any of 
the wide ranges of cell types known and used for displaying 
foreign proteins in extra- or intracellular domains (e.e. on 
the cell surface). This allows the present method to avoid the 
biases of any one expression system. The polypeptide is 
preferably expressed in a membrane protein which is 
biochemically appropriate for the target compound or for the 
system wherein the isolated polypeptide/target pair is 
intended to be applied (e.g. as a vaccine or as a 
therapeutic) . 

Combinatorial chemistry, which is one of the major 
components of the present invention, consists of two or three 
steps, often performed in iterative cycles. The first step is 
the generation of a diverse population of molecules, the so- 
called combinatorial library. In the second step, the 
molecules in the combinatorial library are partitioned 
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according to their exhibition of properties sorrght by 
researchers, such as affinity for a particular compound or 
enzymatic activity. In a third step that is practiced in some 
but not all embodiments of combinatorial chemistry, the 
molecules found to exhibit the desired properties are 
replicated or resynthesJ zed. The cycle may then be repeated, 
beginning either at the first step, with the introduction of 
new variation to the molecular population, or at the second 
step, with the partitioning of the population according to the 
exhibition of desired properties. When a combinatorial 
approach is successful, the cycle of generating molecular 
diversity, whittling down diversity in favor of the few 
molecules that display the desired properties, and, in some 
cases, amplifying the numbers of these molecules, ultimately 
yields molecule:; exhibiting the desired specific activities. 

The immune system is essentially a genetically coded 
combinatorial library, complete with steps of variation, 
selection, and replication: A vast diversity of antibodies is 
displayed on the surface of B-cells (variation), and specific 
recognition of antigen by any given cell triggers that cell's 
proliferation {partitioning and replication) . It is not 
surprising, thou, that the first in vitro gene t i ca L 1 y- coded 
combinatorial Libraries weie directly appi opi 1 a t ed 1 1 om the 
immune system: B-cells were immortalized through hybridization 
t n mouse iiiyvluir,,! lines, (Marks *-t <i i . , i'oM, . Lat*M, in <.>rd>o 
to circumnavigate difficulties of hybridization, and to 
utilize a greater part of the antibody repertoire, E . col i 
were engineered to express and secrete the variable chain:; of 
ant i bodies . How^vei , bei a us secreted ,i n t l 1 )» -d i < ■ s. wee- 
"unt agged" (Jnnd-i, 1 ( > ( .»4 ) vj;:. not phys i ea J 1 y linked to th^it 
genetic codes -■ screens tor antibodies t hcd bound antigen had to 
be performed according to t ochn i qu» ■ s t hat allowed r o sea l '.he r s 
to trace 1 from e a : • h scjc't (d antibody to t h ^ g e n » 1 that encoded 
it. This se ve i >. ■ i y J ■ :< i t • • i : h* ■ is irt.t >• o . • t b a i» - Mi:' : on • a 
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Combinatorial libraries of fusion-phage are composed of 
filamentous bacteriophage or phagemid that have been 
engineered to bear semi-random exogenous proteins as sandwich 
fusions near the end of the adsorption protein pill (Janda, 
1994, Marks et al . , 1991). The method of screening these 
libraries for phage that exhibit affinity for a__target 
compound is known as "affinity purification": the target is 
immobilized on a solid surface or column, over which the phage 
library is washed; pnage bound Lu the target are ciuted, 
amplified by growth in bacterial culture, and then subjected 
to yet another round of partitioning. This cycle is usually 
iterate! until a tractable number of clones are isolated. 
Phage libraries have been used for epitope mapping (Phizicky 
et al., 1995) -viz. pinpointing the region of a larger protein 
that is involved in binding a target compound of interest-as 
well as for deriving novel ligands to targets. Entire single- 
chain variable-fragment antibodies (scFVs) have been inserted 
into phagemid pill (Marks et al . , 1991), and libraries of 
scFVs or semi-random oligopeptides have been screened for . 
binding to a variety of medically significant polypeptides., 
nucleic acids, and molecules not previously known to bind 
proteins . 

However, while fusion phage technology has been 
responsible for significant strides forward in combinatorial 
chemistry - such as the recognition that "tagged" libraries 
would permit screening of a vastly increased number of 
sequences - phage libraries themselves have not escaped 
several besetting difficulties. First, in affinity 
purification of phage libraries, factors besides affinity for 
the target compound contribute significantly to each clone's 
selection. For example, the collection of clones selected from 
a single library by affinity purification can vary 
dramatically with the technique used for elutmg phage from 
the solid substrate oi target compound (Phizicky et al., 
1995). Second, phage adsorption protein, pill, may fail to 
provide exhibited proteins with a sufficiently stable 
platform. Many polypeptides acquire structural rigidity only 
in certain constraining conformations; otherwise they remain 
extremely plastic. Therefore, to be functionally active 
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structures, many polypeptides will require a foundation other 
than pi 1 1 . 

A promising solution to problems with the platform for 
polypeptide display was introduced when a combinatorial 
library of proteins was displayed in an extracellular loop of 
a bacterial outer membrane protein (Brown, 1992). To construct 
this library, between 1 and 20 semi-random oligonucleotides, 
each one 33 bases long, were inserted into the genetic site 
encoding an extracellular loop of the outer membrane porin 
LamB. Like many membrane proteins that have been the subjects 
of topological (Boyd, 1994) and antigen presentation (Georgiou 
et al . , 1997) studies, LamB intercalates the outer membrane 
even when exogenous polypeptide sequences have been inserted 
into one or more of the protein's extracellular domains. Thus, 
when 1 amB genes bearing insertions were used to transform £".. 
cold , each transformant expressed LamB in which one 
extracellular domain exhibited a semi-random, exogenous 
oligopeptide between 11 and 220 amino acids long. 

To screen the library for proteins exhibiting affinity! 
for the target compound iron oxide, the E . coli population was 
placed in a suspension of iron oxide particles, binding was 
allowed to occur, and a magnet was used to recover bacteria 
that huu adhered to metal (Brown, 1992). The selected portion 
of the population was amplified by bacterial growth, and the 
pal t it l < »n i n a pi - )e eou i e w<i s i e j >* -,i I < •< i . A i ■ ♦ m j" < > u i « ■ yc 1 e < > 1 
selection and amplification, fresh variation was introduced 
according to a strategy that would not erase sequence- 
information accrued t tins far. Another loin cycle:- or selection 
and amplification then yielded four clone:;, three oi which 
bore a consensus sequence for binding i i on oxide. 

While the combinatorial library in LamB introduced the 
p r om i s i n q r >< > s s i b i 1 it y o f i i s i no I >a c t * • ) l d 1 Mate. 1 : memb r a f i e 
proteins as platforms for presenting libraries of diverse 

p- 1 J- yt lf M - I ides , • ie :i-ie':t lfll p r .»■■'♦.{ iii j . , ,ippj ici t . , ' lie j lhM| y 
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offers a user- friendly, efficient, and precise method of 
selection, capable of screening libraries of all sorts of 
polypeptides presented on the platform of a membrane 
polypeptide . 

The membranes bounding any prokaryotic or eukaryotic 
cell referred to here as the outer or inner membrane, present 
an extremely complex surface to the extra- or intracellular 
environment. These molecules - referred to here as memorane 
polypeptides - exnioii <x v^ol - — 

conformational, chemical, and electrical properties. Surveyed 
elsewhere in greater detail (Singer, 1990), the membrane 
pniwppptides are here categorized into several structural 
families The conformational and functional differences 
between these families are relevant to this invention because 
membrane polypeptides provide the underlying structures that 
are used to display exogenous sequences, and are selected for 
interaction with the target compound. Moreover, this invention 
often capitalizes on the intrinsic properties of membrane 
polypeptides by imprinting them to exhibit properties or 
perform functions that are similar to their natural ones. 
Finally, the delimation of structural families presented here 
should also provide a profile of the vast range of properties 
exhibited by polypeptides on the cell membranes, for it is in 
part this range that makes the capabilities of this invention 
so broaci. 

Single-pass transmembrane polypeptides (reviewed in 
Singer, 1990), also called bitopic polypeptides, have one 
intra- and one extra-cellular domain, and a midsection that 
passes through the membrane. The terminal domains fold into 
tertiary structures, while the transmembrane section often 
forms a hydropathic a-helix, in which no more than 2 of the 20 
transmembrane residues may be ionic. Singer (Singer, 1990) 
divides bitopic polypeptides into types la, lb, and II (figure 
1) Types la and lb exhibit the amino terminus on the 
extracellular s^de of the outer membrane. Distinguishing type 
la is an ammo terminal signal sequence that directs export to 
the membrane, and is cleaved before final membrane insertion. 
Type la polypeptides are ubiquitous, and in eukaryotes they 
are the predominant type. Because the transmembrane sequence 
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of type la polypeptides is typically closer to --the carboxyl 
terminus, the extracellular domain is often large. 
Consequently, type la polypeptides are competent for functions 
involving high-affinity adhesion or highly specific binding to 
especially extracellular targets. Among type la polypeptides 
are receptors for growth factors and for peptide hormones; 
immune system antigen recognition molecules; and molecules 
responsible for cell adhesion (Singer, 1990, Staunton et al . , 
1990). Proclivity for adhesion or highly specific binding to 
extracellular targets make type la polypeptides suitable for 
use in many embodiments of this invention, especially those in 
which a polypeptide is imprinted to bind to an extracellular 
targe t . 

Typo lb polypeptides do not: have a cleavable signal 
sequence. The transmembrane stretch is typically close to the 
amino terminus, and therefore there are few extracellular 
residues (Singer, 1990). This makes type lb polypeptides 
suitable for sophisticated cytoplasmic functions, and 
interactions, but poorly suited for interactions with targets 
in the extracellular medium. 

Type IT bitopic polypeptides are oriented with their 
carboxyl t orminus on the exterior side of the membrane (figure 
1). As in type la bitopic polypeptides, the bulk ol type 11 
molecules is usually on the exterior of the cell. Type II 
polypeptide's are typically enzyme:* with ex t racei i u In i aci i 
sites, such as glycosyl transferases, or highly specific 
extt a c e 1 1 u 1 a i tece p t o r s , such a s 1 1 i e a s i a 1 o q 1 y c o p r o tein 
receptor (Singer, 190O) . <."omnot en re for ext. race! hilar 
enzymat ic or rccopt oi act lvity make:; t ypc 11 bitopic 
polypeptides suitable toi use in certain embodiments oi this 
invention, in which extracellular domains are imprinted to 
hi n d o r e i \ z ym a t i c a 1 1 y m o d i f y f a r q e t s in the e x t i a eel 1 u 1 a i 
med i um . 

Mu i t i pas ; t 2 . insmfMnb r in* ■ i • • • i /pep* i i-, i i . - v o -w* ■ i in : * i uge : , 
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to expose either its end or yet another loop leading • to more 
transmembrane passes (figure 1)- Although referred to and 
often illustrated as "a loop", each exposed region of a 
multipass polypeptide exhibits not only a unique sequence of 
amino acids, but also a distinctive tertiary structure. Like 
single-pass transmembrane polypeptides, multipass ones are 
suspended in the lipid bilayer by the hydrophobici ty of 
transmembrane strands. However, in multipass" polypeptides, 

t ; . _ ■; _,v, i , , <rt- Sn+-o a — helices, because 

these stranas au not x>,va^ -a.^ 2 ~»~~~ 

when multiple strands neighbor one another, they can adopt an 
alternative strategy for hydrogen bonding between peptide 
bonds: if each strand adopts the secondary structure of a P - 
strand, these strands can curl together into a closed 
cylinder, allowing hydrogen bonds to form between the peptide 
bonds on separate strands. The resultant tertiary structure, is 
called a p-barrel (figure 1). It is extremely stable, and can 
be released from the outer membrane without denaturing 
tertiary structure. In several preferred embodiment of this 
invention, the extremely stable, polytopic p-barrel OmpA is 
imprinted to bind to various targets. 

Polytopic polypeptides that traverse the membrane in-a- 
helical conformation often function as specific receptors, for 
messengers or hormones, such as yeast mating pheromone and 
many others. These and other a-helical polytopic receptors are 
categorized by Singer (Singer, 1990) as type III polypeptides. 
Their capacity to bind target molecules specifically, and 
sometimes multivalent^, makes a-helical polytopic receptors 
ideal for use in many embodiments of this invention. Moreover, 
selection systems according to the present invention are easy 
to construct when the membrane polypeptides involved in the 
selection are polytopic. 

Many prokaryotic and eukaryotic cell-surface polypeptides 
are fixed to the lipid bilayer not by a hydrophobic- 
transmembrane strand, but rather by a glycolipid anchor 
(reviewed in Cross, 1990) (figure 1). An anchor consists of a 
lipid molecule, such as dimyristyl glycerol or alkylacyl 
glycerol, adorned by a mul t i - res i due sugar complex, which is, 
in turn, covalently linked tc the membrane polypeptide. Among 
eukaryotes, two signaling elements of a protein direct the 
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addition of a glycol ipid anchor: a we] 1 -de f inech ami no terminal 
signal sequence, which enables transport, into the lumen of the 
endoplasmic reticulum, and a carboxy terminal qlycolipid- 
addition sequence, which is not precisely defined but 
generally consists of a short hydrophobic domain located 10-12 
residues carboxy terminal to a pair of small residues (Cross, 
1990). Once appended to the carboxy terminus of a protein, the 
glycolipid anchor itself can direct polypeptide insertion in a 
targeted region of the outer membrane. 

Lacking an intracellular domain, g 1 y c o 1 i p i d - a r i c h o r e d 
outer membrane protein?.; cannot, perform eytosolic or 
per i pi a snu c fund ions. InrA.ec^i, they provide protect ion from 
the extracellular environment, or function as enzymes or 
adhesins recognizing exclusi vel y extracellular substrates . At 
least six leukocyte surface proteins, many other eukaryotic, 
proteins belonging to the immunoglobulin superfamily (Cross, 
1990), and tin: trypanosome variable surface glycoprotein (the 
protective protein that the trypanosome sequentially alters to 
avoid detection by the host immune system) are anchored by., 
glycol ipids. These examples illustrate the characteristics of 
glycol i pid-an< 'ho rod prot e i ns t ha t a re impor taut t o thei r use 
in embodiment: of this; invention: capacity for highly specific 
adhesion and plasticity that, allows vai iation of the anchored 
protein without disturbing a robust system tor export to and 
insert ion ; n Mio -. m T e r io -n.t > t riiic . 

Among oukaryotic cells, the diverse array of surface- 
exposed prot e lies ~i s adorned by yet a not. tier tier o 1 complex 
molecules with dist met iv« propoit ies, namely, the saochai ides 
(reviewed in Lis 1 1 a 1 . , 1 9 * M ) . The > J l g o s a e e h a r i d o ; ador n l ng 
q.l ycof >rr d <• i ns a^s-oally consist <d l>\ss than If. residues. 
Howevei , owing to t )\^ (diversity of residues and eovalent 
1 inkaijcs^ t licit permit assembly into blanched as well as linear 
structures, o 1 i gosa eohn i l d* v ; exh.ilu ! t [lousands r > 1 lis! iiv.l 
s t r i t u r t ■. . . Sh- f >< : i y . i e- > i , o eC:. c , , , i ( n , , j m , , ]r .\ , > , , . ,. . 
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receives distinct saccharide chains when expressed in 
different cells, and separate copies of a polypeptide 
expressed in a single cell type can have different sugars 
attached at a given site. Because sugar residues are 
sequentially appended, each by a particular enzyme to which 
the protein is exposed in the course of the processes of 
translation, folding, and export to the outer membrane, the 
glycosylate at a site is largely a product of the extent and 

r . v _. ; i ^ ^y^oom-o t-o Parh en z vme . Exposure of a 
duration oi Liitr o^^o ^ ^ a ^ ~ ~ 

site is, m turn, affected by the way a protein folds and the 
rate at which it moves through the steps from translation to 
insertion. Consequently, the gl ycosylat ion at a site can be 
altered in unpredictable ways by changes^to the local sequence 
or to the sequence elsewhere in the polypeptide (Lis et al., 
1993) . 

Diversity of molecular structure and relatively complete 
exposure to the extracellular environment make the saccharides 
appended to proteins effective receptors of specific 
substrates outside the cell. Glycoproteins and proteoglycans 
are involved in the intercellular recognition and adhesion 
underpinning sperm-egg interactions, blood clotting, and 
inflammation (Lis et al., 1993, Singer, 1990). The roles they 
play, via adhesion, in these important processes make the 
glycoproteins and proteoglycans ideal for use in certain 
embodiments of this invention. 

Therefore, in a preferred embodiment of the present 
invention, the membrane protein is an ot-helical polytopic 
receptor, a glycolipid-anchor ed polypeptide, a polytopic 
integral membrane polypeptide, a membrane-bound isoform of a 
secreted polypeptide, a bitopic integral membrane polypeptide, 
a glycoprotein or a proteoglycan. 

It is significant, to some embodiments of this invention 
that insertion of polypeptides into the membrane and 
translocation of secreted polypeptides across the membrane are 
oerformed by the same cellular machinery (Singer, 1990). Among 
eukaryotes, integral membrane polypeptides inserted into the 
endoplasmic reticulum (ER) have the same cleavable or 
noncleavable ammo terminal signal sequences as completely 
translocated polyeptides. This is consistent with the fact 



RN^DOCID <WO 99301 SI A 1 I ^ 



WO 99/30151 



PCTYEP98/07922 



-13- 

that insertion and secretion depend on the same-cytoplasmic 
siqnal recognition protein (SKP), which transfers both 
inserted and secreted polypeptides to the SRP receptor on the 
ER membrane. Similarly, among bacteria, membrane insertion and 
trans 1 oca t i on seem to require the same gene products, namely 
secA and secY (Singer, 1990). Among prokaryotes and eukaryotes 
alike, there are many examples of secreted anci membrane-bound 
polypeptide i so forms. Often the only difference between the 
isoforms is the existence of a hydrophobic stretch of residues 
or a glycol ipid attachment site near the carboxy terminus of 
the membr a in-- bound polypeptide (Singer, 1990) . 

The process of type III polypeptide intercalation into 
the membrane also has implications that, are significant to 
this invention. The cellular machinery that inserts bitopic 
polypeptides and translocates secreted ones is also 
responsible for intercalating a-helical multipass polypeptides 
into the outer membrane. This is clear from studies that show 
that i) type III polypeptides sometimes have the same signal 
sequences as bitopic and secreted polypeptides, and ii) .. 
membrane intercalation of type III polyeptides requires the 
same gene products as insertion of bitopic polypeptides. 
Sever a 1 expe r i men t a 1 app roa^hos , i nc ! udi ng gene f us ion , have 
shown th.it the cellular machinery tor inserting polypeptides 
in the outer membrane intercalates most typn Til polypeptides 
bv t : .Hi.: 1 ■ . \ ; t i :ig >:s - h '/■. i r •» >y i. : \ l ■ s-S i . i • 1 1 u A , 1 1 iu(.|j ,c -i t nn*-- . 
The p o 1 y p e { > t i d e is t h u s int. e r c a 1 a t e d a s a s e r i e s o f 
independent ]y insertoi segments (Singer, 1990) . 

Most ( ml >od i men t . i.i this invention t oqu l r o a t t >p- > 1 ■■ u c« j 1 
map of Mi- imprinted po 1 ypep t i d* • , arid a technique He 
: n t i c > d ; : . ■ l a a 1 1 • ■ w . . * mji ; • • n c e s to s. p » 1 c i I i < - d e x t i « i c .: t 1 1 u L 1 i r d < m n 
ot the imprinted polypeptide. Roth ol t h<>so needs ate met by 
u ( m i e 1 u s i • r ) t (v/hni q \ i e : ; . 

A vad variety < d po j ypop t J do : : , ^:-poi.il ! y t h- 
p.- •. 1 v p p * e; i . : r » • v i i ab«. . v, h . : v » ■ o km-i y i ■ • 1 • n 1 ^ • p« ■ ! t ■ S J * - 1 i i V 
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the membrane polypept.de bore the reporter moiety as a mid- 
sequence insertion or as a C-terminal extension. Provided the 

o e Z -iety disrupted neither sorting of the Polypeptide to 
the appropriate membrane, nor insertion of the polypeptxde 
into the membrane, the reporter was located in the same way 
as the immediately adjacent region of the membrane 
polypeptide. This region's location on the exterior or 
interior of the membrane was then inferred from the reporter s 
' , ... .u_ e ^ ThP Pntire polypeptide's topology 

activity, 01 j-dof. -"^ e - h ,.u HH 

was deduced ,y assaying reporter activity in a set of hybrrd 
polypeptides, each of which bore the reporter at a different 

,»„ in the »e*r. M polypeptide's sequence (these ,e»e- 
tusion .app.n, techniques are teviewedin Boyd 199 h Nor un., 
has this approach provided topological oaps of most of the 
polypeptides t.-»»lV used in gene technology, especially the 
pro«ins described above; it can also be used to determine the 
topology of any membrane polypeptide that has not yet been 
mapped! but is a candrdate for imprinting by this rnventron. 

in addition to a topological map of the imprrnted , 
polypeptide, each embodiment of this invention also requires 
technique for inserting exogenous peptide sequences into one 
or more extra- or intracellular sites in the imprinted 
polypeptide. Alternatively, exogenous peptide sequences may 
replace exrs t rngsect ions of the extra- or intracellular 

domains of membrane. huhrid 
Because gene-fusion topology studies used a set of hybrid 
polypeptides, each bearing the reporter in a region whose 
location relative to the membrane was to be ™ er 

techniques were developed to enable insertion of the epotter 
gene at precisely specified sites in the membrane protein 
gene. These techniques include the foiling: 1. the use o 
vectors with restriction sites that occur at unique locations 
ln the membrane protein gene; if. an oligonucleotide^! reeled 
deletion mutagenesis method; and iii. polymerase chain 

methods (Boyd, 199,,. Each of these techniques can 

used in this invention to insert exogenous sequences into a 
specified Site in the imprinted polypeptide; an «"n«on^f 
these techniques, which is used in this invention to introduce 
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new variation to extra- or intracellular domains, is described 
bel ow . 

Beside:, providing topological maps and techniques for 
insert i nq exoqenous pept ide sequences , gene- f us i on studies 
have collectively demonstrated that introduction of an 
exoqenous sequence of peptides to an extra- or intracellular 
domain ol a membrane polypeptide does not interfere with the 
sorting of the polypeptide to its appropriate membrane, or 
with the polypeptide's normal insertion into the membrane. 
This capacity of extra- or intracellular domains to 
accommodate foreign oligopeptide-: 1 , is critical to the 
const met 1 or: o t comb] nat or 1 a 1 1 i brar i es o f exogenous 
oligopeptides in the extra- or intracellular domains of 
membrane polypeptides. Tins rapacity is also, therefore, 
critical to certain embodiments of this invention. It has been 
observed in various bacterial, yeast, and mammalian 
polypeptides, including the examples described in the present 
speci f icat i on . 

An extracellular domain of lamB, an outer membrane porin 
of E. coli, has been shown to accept exogenous polypeptides up 
to 16 r - amino acids in length without disturbing normal sorting 
or insert o n of \ .ho protein (Brow::, l'". ,c >?) . Foreign sequence: - ; 
havo iil:;n h-on inseited in; o ^xo aceiluLn domains of E . coli 
PhoF ( phospha t e- 1 imi t a t i on- i nduc i bl e outor membrane pore) , and 
FhuA ( 1 e i r l • "h i ■. 4 uptake recent o; j , a, id t lie \ \ i. o-m i i ui is had no 
effect on protein sorting or insertion (Georgiou et al.,1997) . 
In the specific case of E. col a ornpA, one of the most abundant 
polyp»'pt id**;, ui the bti-d ri nil on' cr nv.mbran' (Mm dim et til., 
1 9 P ' . \ , o .1 i g«. ■ n i i t " 1 o o t i de : rich in restriction r- i . < i o n 1 1 c 1 e < i s t ; site; 
we if inse::*-ti jut<> the , >A qene at sites corresponding t i > 
protein !i)i)p:; II and IV, rind, the ait e r ( 1 d \ > r i > t e i n w , i s e x f j t e s s ed 
and exhibit ed normally (Kreudl, l ( )B r 3) . Subsequent iy, i i i J iuenza 
ha ema a 1 u t i iu n , bet a - u< i 1 a c t o c: i da ( • t { < a and nu >u t h d i sea so 
' * i ? • ! p r ! e : n • ( M-1 i V I ' ! - , i:t-sl-= - i » c j * o i . ■ : : : . , . j 1 1 • i • ■ v e : . i ! 
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antigens were shown to elicit intense virus-specific, immune 
response in rabbits (Georgiou et al . , 1997). Second, Cex 
exoglucanase displayed on an extracellular domain of OmpA 
hydrolyzed cellulose as effectively as conventionally 
immobilized enzymes (Francisco et al-, 1993). 

in yeast, extracellular domains of a number of membrane 
proteins accept exogenous peptide sequences without exhibiting 
disturbance to normal sorting and insertion (Boyd, 1994, Boder 

t-ir membrane polvpeptiaes 

and Wittrup, is^'i • nxyi.c- _~ j- - 

. . . r __-4-.. ~ p^r-o D t- foreign inserts at 
have also exhibited tne Ldpa.xv., to a^~-P~ j 

extracellular domains. When a polypeptide is expressed m 
p„V,rvotic cells, gene fusion studies may insert extra 
glycosylate sites to function as reporters: only polypeptide 
domains on the luminal side of the ER are exposed to 
glycosylating enzymes, and luminal domains become 
extracellular domains. Extra glycosylate sites were inserted 
into extramembranous regions of mouse muscle nicotinic 
acetylcholine receptor: the inserted sites did not disrupt 
normal sorting or insertion of the outer membrane polypeptide; 
and, sites inserted in extracellular receptor domains were in 
fact glycosylated (Chavez et al . , 1991). 

in summary, gene fusion mapping techniques contribute to 
this invention in four ways. First, these techniques have been 
widely used to determine the topology of countless outer 
membrane polypeptides, many of which are described above and 
may be imprinted by this invention. Second, these techniques 
may be used to provide topological maps for polypeptides that 
have not yet been mapped, but may be candidates for imprinting 
bv this invention. Third, gene fusion techniques provide this 
invention with commonly practiced methods for inserting an 
exogenous peptide sequence into a specified region of a 
membrane polypeptide; the use of these techniques in the 
invention, as well as the straightforward extension that 
enable- the insertion of variable sequences, is described 
»~iow Fourth, aene fusion topology studies have demonstrated 
that the introduction of an exogenous sequence to an extra- or 
intracellular domain of many mapped polypeptides does not 
interfere with the polypeptides' export to or insertion in the 
membrane . 
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Certain embodiments of this invention involve alteration 
of a polypeptide's mode of attachment to the outer or inner 
membrane. Because insertion into the membrane and 
translocation across the membrane are handled by the same 
cellular machinery, secreted, transmembrane , and qlyco lipid 
anchored isomorphs of a single polypeptide are. often 
interconvertible . Moreove r , secreted and membrane-bound 
isomorphs can use the same amino terminal signal sequence, so 
in many cases, the only alteration required for conversion of 
a polypeptide to its isomorph is a chanae at the pol ypept ide 1 s 
carboxy terminal. Attachment of a 20 residue hydrophobic 
sequence to the carboxy terminal of a secreted polypeptide is 
often sufficient to construct an isomorphic polypeptide that 
is anchored by a t ransmembi ane span; this technique for 
converting a secreted polypeptide into a membrane-bound one as 
described in the literature (Singer, 1990). Alternatively, 
transmembrane or secreted polypeptides can be converted to 
glycolipid anchored variants by addition of carboxy terminal 
signal sequences from the precursors of glycolipid anchored, 
polypeptides; this technique is also described in the 
literature (Cross, 1990). In some cases, a glycolipid anchor 
not only attaches an exogenous polypeptide.' to a cell membrane, 
but rilso directs sorting el the anchored alien polypeptide. In 
such c^s^s, a single' glycolipid anchor sequence is all that is 
required to d J r < • ~ f an !-xog f -ne«us p'dypep! ide to the membrane, 
translocate it, and anchor it (Cross, 1990). 

In certain embodiments oi this invention, two or more 
outer oi inner momb r a a« ■ polypeptides ate 1 used to construct u 
chimeric mult ip,;ss polypeptide comprising extra- and/or 
i nt r rice l I u bi i doma l ns or bo t h oons t i t iiont pol ypept i dc?s . Tin:- 
ma n i p u I < i 1 ion has been sue« e s f ■ i 1 1 y pe r tui rued with a variety of 
prokaryot ic and eukaryot. i e pol ypept .ides . Feu example, in V, . 
oej / ompK-ompc chimeric piotoin U>'M (Kourel et al., 
the r eiji on •: c < iiitf • V < >irp r o : s • : cxt jeieel iiil.ii 1 imp: ■ 1 through 
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chimeric polypeptides, novel transmembrane polypeptides were 
created as follows. A region of the human asialogl ycoprotein 
receptor HI, which included the polypeptide's transmembrane 
domain (20 hydrophobic residues, 40-61 in the wild-type 
protein), was tandemly repeated two, three, or four times. In 
each case the HI repeats were joined by relatively hydrophilic 
linkers, such as the hexapeptide HRPSLG. These constructs were 
normally inserted into membrane, and luminal domains were 
normally glycosylated. 

The general principle that often makes fusion of membrane 
proteins possible has already been outlined: the above 
description of the process of membrane protein insertion 
emphasizes that a-helical transmembrane strands of polytopic 
membrane proteins are inserted sequentially and independently 
of one another. An implication of this mechanism of 
intercalation of transmembrane strands is that almost any 
chimeric polypeptide that is constructed of an ammo- terminal 
signal sequence followed by multiple @20 - residue-long 
hydrophobic stretches, alternating with stretches of more „ 
hydrophilic residues, can be normally inserted, provided each 
hydrophobic stretch is independently stable in the membrane. 

In a preferred embodiment of the (resent invention, the 
extra- or intracellular domain of the membrane polypeptide 
contains a highly variable region, which is, upon binding to a 
tarqet compound, identified and isolated as the polypeptide of 
interest or a section thereof. With the gene technology 
methods described above, in combination with combinatorial 
genetics, the skilled man in the art can make a library of 
cells displaying a huge variety of variable regions, which can 
be identified and isolated with the present matter in a very 
short time. It is possible to provide the gene for the 
polypeptide of interest simultaneously with the polypeptide of 
interest. The highly variable domain may comprise polypeptides 
encoded by a cDNA library or sections thereof. 

The membrane polypeptide can be a homologous protein for 
the cell (adapted for the present method) or - which is 
preferred - a heterologous polypeptide tor the host cells. 
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Preferably, the membrane polypeptide is ge^t ically 
engineered to contain the variable domain and the .selection 
site residing adjacent to each other on the cellular surface. 

The target compound can be any material, e.g. chemically 
produced substances, proteins, nucleic acids, complex product? 
derived from biological sources or mixtures or .(.combinatorial) 
libraries of these materials. Preferred substance classes for 
the target compound are known drugs (or derivative:; thereof) 
(e.g. t.o investigate further binding patterns of such 
compounds to polypeptides), polypeptides (to identify and 
isolate prot ein/protein interactions) , antibodies or antigens, 
vaccines or pathogens, etc.. 

If the present invention is performed using an 
intracellular domain of the membrane proTei n, i.e. the 
polypeptide of interest is displayed at the; inner surface 
(including the periplasms), it is essential for the target 
compound to be present inside the cell. This can be achieved 
by transportation or diffusion of the target compound from 
outside into the cell (which is, however, only possible with a 
few or small target compounds. An alternative way of bringing 
also the target compound into the cell is by coexpiession of 
the polypeptide of interest and ttw target compound in the 
same cell. Systems to enable such a coexpression ji two (or 
more) exogenous proteins in one and the same cell are known in 
the a i t . I; is clear thai this way <>\ pe r I o r itu n g r he invention 
is restricted to target compounds that can be produced by the 
host ceil, e.g. proteins or nucleic acids. 

Km select! >n or t h" interact ion between target compound 
and polypeptide C inteie.-i, any suitable selection system 
known in the ait may be adapted to t h< method according to the 
present invention. These selection systems are known t.o the 1 
skilled man in trie art in principle, the* adaption of any 
suitable select ion system to the system according to the 
p i " s e n ♦ 1 n v ♦ ■ 1 1 *, i o i ; will b ♦ ■ e x p i ; i l n e ( i [ > e b ■. w . 
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compound results in a signal which influences the status of 
the selection site. This influence may be a direct or an 
indirect interaction of the membrane polypeptide or the bound 
target compound with the selection site. A direct interaction 
may e g- be performed by a sterical hindrance of the selection 
site upon binding of the target molecule, i.e.. the site, where 
the selection agent should bind, is not accessible anymore, 
because of the physical presence of the target molecule. 

ht= i nduced bv a 

Alternatively, sucn a aucxx^ox - . 

conformational change of the membrane polypeptide upon binding 
of the target compound to the polypeptide of interest-portion 
of the membrane polypeptide. Preferably, the selection site is 
present in the vicinity of the membrane polypeptide, e.g. by 
covalent bond to the membrane polypeptide. The selection site 
can also be a part of the membrane polypeptide itself. In such 
a case, the sterical properties of the system are clearly 
defined and easily adjustable by common protein engineering. 

Therefore, a preferred embodiment of the present 
invention is a method, wherein the interaction between the , 
extra- or intracellular domain with the polypeptide of 
interest and the selection site is a sterical interaction, 
especially a sterical interaction which does not allow the 
selection agent to interact with the selection site after 
binding of the target compound to the polypeptide of interest. 
Preferably, the selection site is immediately adjacent 
(especially via covalent bonds) to the extra- or intracellular 
domain of the polypeptide of interest. 

In a preferred embodiment oi the present invention, the 
selection site is an adhesion domain tor the selection agent, 
e.g. an adhesion domain which i, also part o£ the membrane 
polypeptide displaying the polypeptide of interest. 

In another preferred embodiment, the membrane protein 
exhibits an additional domain or. the surface of the cell, 
which additional domain, is adjacent to both, the variable 
domain and the selection site and this additional domain also 
interacts, preferably by binding, with tin, target compound. 

Indirect interaction can be performed e.g. by messenger 
molecules, which are released upon binding and induces a 
change of the selection site by binding or by contact. 
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It is essential that the change in selection specificity 
of the cell is in direct correlation to the binding of the 
target compound to the polypeptide of interest, i.e. the 
binding immediately induces the (direct or indirect) 
interaction with the selection site. 

The mode of selection in the method according to the 
present invention is not critical, positive and/or negative 
selection protocols are applicable. In a preferred embodiment, 
the selection agent is a toxic agent.. 

Many of the polypeptides of the 1 (outer) membrane are 
port:; of adhesion for an array of toxic agent, s that kill or 
retard multiplication of the cells they enter. For the purpose 
of the present invention, the term "toxic agent." denotes any 
agent that impairs or inhibits growth oF~the cells. Most of 
the classical selection systems applied in molecular bio logy, 
fall under this type of selection system. Due to its nature, 
this selection system is preferably applied in the 
^extracellular" embodiment of the present invent i on, wherein 
the polypeptide of interest is located in an extracellular 
domain of the membrane polypeptide. 

Many to x 1 c aqents r e coqn i '/. e h s p e c i f i c r o q 1 o n ( t h o 
adhesion domain) of a specific: polypeptide ft ho- receptor) on 
the surface of a specific cell type (tin- host ) . Binding by the 
agent to the adhesion domain is the first step in a process 
that culminates i r i cell lysis : • i : : 1 h 1 1 i t i ■ > : . , t s o i ! growth a i i c i 
multiplication. 11 this first step is impeded by mutations in 
the adhesion domain, by repression of the receptor, or by the 
interference of an inhibitor ore hid ma ?n<* adhesion ■ !< >ma i n , a 
se .1 eot i on a gen t 1 s mv.i s ion o i t cell is. blocked. When <i 1 1 o t 
"m agent ':. possible rco'i'tui iiiohcules on a ;vi 1 die, lor 
what e vol reason, i iuicoes s lbl e m ay s 1 u i ic t o .-n , i i b_o agent 
entry, the cell is rendered immune. When some but nor all 
receptors are inaccessible or dysiunet rona 1 , the- cell may 
become resistant but u<>> immure : r • • in-' ; o i ■ J i h- numbti •. . I 
r f •« '* pt . • » ' : \\\. r: . \ v: t * ■ . » r 1 1 » •.• * . • . • • • ; • '• 
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regard to specificity and mode of actron on the-cell. 
example, pathogenic agents like bacteriocrns and 
bacteriophage* - for which the above mentroned pathway via an 
adhesron do.a.n is quite »»n - can be altered such that 
they change their preference for a specrfrc adhesron domarn. 
Thus it is easily possible for the man sxiHed.in the art to 
design strategies to male pathogens adhere to a great varrety 

of adhesron ^marns ^ lnve ntron, the 

in a pittcnc^ - — ~ 

• tr-aer — agent, such as a bacteriocin, a 

selection agent ib <= H atr,vqen_ d j 

^ r -j r~,xin or virus of an eukaryotic cell, 
hnrf er lophaqe or a LoXiii 

Bactertorins are bacterially produced proterns that are 
harmless to the strains that express and secrete them, but 
"a a to othe,, often cfosefy reiated, bacteria (barter rocrns 
are ieviewed in Pattus e. a!., 1990,. The most extensively 
studred of the bacteriocxns are the colicrns, whrch are 
pressed by ,. coir, and are encoded by piasmrds that aiso 
code for the proteins that confer colicin immunity upon 
colicln-producing cells. The colrcins are composed of three, 
structural domains, each of which is responsible for one of 

: consecutive steps of colicin activrty: i recognr ton of 
an adhesion domain, 11. translocation across th« , eel Xul.r 

. nH , lt .~p.ll destruction (Pattus et al . , 199U). 
membrane; and in. '-en 

The colicins kill by two principal mechanisms, i. the 
formation of a pore in the cytoplasmic membrane, resulting in 
membrane penalization an. de-energization (colicin ' 
B, 1., Ib. and K ) ; ii. en,ymatic cleavage of DNA -IBS 

! pnr (colicins E?. and E3) (Pattus et al., 1990). 
ribosomal RNA (colicins rnlirin3 
emetics of cell death when bacteria are exposed to colicin, 
♦ \ h t th , toxic molecules enter the cell in discrete, 

1^ consist c, a number of active molecules imported in 

::,:: e ::i: step * . . ^o.^ 

here is that a single dose kills a single cell. 

Almost every major outer membrane protein ot t 

frr -,t least one colicin. When multiple 
the receptor for at iea-i 

:rr^r. l £>:. .illustrated 1B 
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figure 2, OmpF exhibits seven extra-cellular loops, LI through 
L7, where LI is amino proximal and L7 is carboxy proximal. The 
adhesion domain of colic.in N includes LI, L3, L4, Lb, and L6, 
hut not LP- or 1,7. Thus, alterations in the peptide sequence oi 
LI result in total immunity to colicin N, wherea? alterations 
to L2 do not confer even slight resistance. The adhesion 
domain of colicin A, another colicin that recognizes ompF, is 
somewhat different. Loops 1 through 5 are required tor 
colicinA to bind its receptor, while Lo and L7 are superfluous 
(Fourel et al . , 1 990). The chimeric; ompF-ompC protein 3601, 
which wa.s described above, comprises loops 1-5 from ompF and 
loop::- (j and 7 f rom the dist inct ompC . As would be expected, E. 
roll expressing 3601 are totally susceptible to colicin 

A, but totally immune to colicin N. 

The number of receptors per cell is important for an ea,sy 
handling of this invention's selection of specific cells from 
a highly variable population. The functional receptors of 
col icins are generally present on the surface of the cell in 
thousands of copies. However, it is important for an optimal 
performance of this invention that the level of expression of 
n i o t ; t r o c op: or s i : . e < i s i 1 / rn a r i ] } ■> u 1 . 1 1 o d : the gene encotiing t ho 
r r .(--<;:. p t- n r h :.' ,i ] ri\"iciy been, or may ea: i 1 y bo, cloned into a 
p 1 a -;m i d with an l nduc i. b 1 e hvt e r o 1 ogous f u.omo ter , pe r mi 1 1 i ng 
external regulation of expression. Protocols for the 
c-: 1 n i ■.]■•"* i ■ / 1 . cin i ma n i p u 1 ci t i on o I piasnn os with i ndue i b 1 e 
prom'iters, permitting regulation of expression, are described 
in the literature (Brown, 199.,-, Freud], l r *89). Protocols tor 
the ■■ - o • i . ; t r u ( - * l o i ] and mani pul at ion i C p o i s m l d s wi th inducible 
pr on. >t p » m m i t t i n ■ j r - • u u I . 1 1 l < > n ■ d » x f ■■ i > 1 : . s i o n <>r < i v i i i . t b b • 

! . • t 1 1 o . . • • pnoi coo i : ; t h " art . 

H . i .• - 1 e r l <> p h a g e s are a iihi(|ii i t ou:; and oxt i aoroi nar i I y 
di verso gtoup oi v i i a i parasitica of bacteria. Ther o are twelve 
families - >\ p h « i g * o c cm j > i i s l n g tSfM) di :;t jnct i s o 1 a t » ■ s ■. . t k n caw ; : 
iv •• : ph» .■ P j y , e.i : 1 1 1 * * ■■ t :nu - >v< r ]n<) baotci i.ii u«Mic:a. 'Hwo: 
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Extensive reviews of morphology, genetics, ecology, and life- 
cycle are available in the literature (Calendar, 1988). 

Very much like bacteriocins , phages bind to specific 
adhesion domains of receptors on the host surface. A single 
outer membrane protein may be the receptor for multiple phage 
species, whose adhesion domains overlap to varying extents. 
Fiqure 3 shows the adhesion domains of various phages, as well 
as'one of the colicins, that utilize E. col! outer membrane 

a 1 «?o shows the 
protein A as tneir receptu,. u.c _g~- --• 

receptor domains that are manifestly not utilized by the 
phages: alterations to these regions of the receptor do not 
,f fe n phHflR adsorption, and thus do not confer phage- 
resistance to the bacterial cell. Phage_receptors, like those 
of bacteriocins, are often present in 100s to 1000s of copies 
per cell, but this number can be manipulated through use of a 
plasmid-encoded receptor regulated by an inducible 
heterologous promoter. As was stressed above, regulation of 
the level of expression of the imprinted polypeptide is 
important to this invention. Protocols for the construction^ 
and manipulation of plasmids with inducible promoters are. 
described in the literature. . _ 

Following binding to the receptor, the viral nucleic- acid 
enters the bacterial host, and phage multiplication commences. 
The strategies for multiplication, and the effects of 
multiplication upon the host, vary widely across phage 
species, immediately following infection, some but not all 
species ot phage confer immunity to subsequent infections by 
other individuals of the same phage species. If the infecting 
phaqe is a virulent one, it quickly modifies normal bacterial 
syntheses in order to appropriate them to its own 
multiplication and release into the extracellular space. 
Distinct strategies for release have very different effects 
upon the host. The mov.r^ae (a family of filamentous phages) 
are liberated by extrusion: new phage are secreted through the 
k,^4,i m ^brane (Calendar, 1988). While this process is not 
lethal to the bacterium, it is metabol ical 1 y taxing, and 
therefore infected bacteria divide at a slower rate than 
uninfected bacteria. Transmission also takes place vertically, 
as dauqhter cells inherit the infection from their parents. A 
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strategy for release similar to extrusion is exhibited by the 
PI asmavi ridae (a family of pleomorphic phages). These phages, 
which consist only of a double strandeci DNA molecule 
surrounded by a lipoprotein envelope, are liberated from the 
host by budding from the cellular membrane. Again, the process 
is not lethal but retards multiplication. By contrast, lysis, 
a means of release for tailed and cubic phages, destroys the 
host. After intracellular phage multiply to numbers of 20- 
1000, they weaken the bacterial cell from within, causing it 
to burst and liberate the infective particles (Calendar, 
198 8) . 

The bacteriophages exert such a wide range of effects 
upon, their hosts that even among phage- species using the same 
receptor there may be several different consequences of 
infection. Moreover, the effects of a phage upon its host can 
be manipulated. Mutagenesis followed by artificial selection 
for phage virulence or benevolence can intensify or mitigate a 
phage's deleterious effect on its host. Insertion or deletion 
of phage genes, including insertion of genes that confer 
met a bo. I i c capabi 1 i t i es o r ant ibiot i c r es i stance upon in f ec t ed 
bact e r in, a r e <: : o mm <: > n 1 y p ract i c e d nu * a r i s o f in a ri i p u 1 a ting t h e 
impact, that phage have upon- t ii-i i b. t e r i a 1 hosts. Anothor way 
t.u mudLfy a phage's effect., upon its host is t < * alter, the 
hos*. . For example, certain groEL mutations in E . coll produce 
bact-r. r;i.l straw::.: thd .if sus^ept l b i <■ to T4 , '! iambdri, and 
186 phage infection, but do not permit the intracellular 
assembly of mature phage part rcles ( Z e i 1 e s t r a - kyu 1 1 s et al . , 
1^0 <). Thus, infected baeteira are retarded, -m even killed., 
bul t h^y do n-it y i e 1 < i i n : er t o. n ;s phni*- paitieles; in short, 
t hf\s*' Ivici oi u rire a dead-end toi t hr phacj" thai liilect them. 
Finally, t he effects ol phage upon their hosts can be nio-li t lod 
by the preparation of phage ghosts (phage depleted oi DNA), by 
methods in the liteiatur- (Douiangei et al., 19HB). 

}',«•• ji;. • Mm-V d-. ! :--t ^ it ! V i ■ 1 ► • 1 1 ' - lie ■ , ph-i'l" nhos's ■ !■ ■■ n< d Hi! - 



WO 99/30151 



PCT/EP98/07922 



- 26 - 

channels in linearly related to the number of gtrosts that bind 
to the cell; gradually, the efflux depolarizes the cell, which 

eventually dies . 

Toxins and viruses of eukaryotic cells recognize specific 
adhesion domains of their receptors on eukaryotic' cell- 
surfaces. The receptors of at least thirty different species 
of virus of eukaryotic cell lines, representing eleven 
different vira] families, are described in the literature, and 
for most of these pathogens, the viral adhesion domains, which 
may comprise oligosaccharides, glycosylated peptide sequences, 
or sequences unadorned by sugars, have been characterized 
(Tyler el di . , 1996). The receptors of eukaryotic pathogenic 
agents, like those of bactei locins and bacteriophage, are 
commonly cloned onto vectors and expressed from inducible 
promoters, allowing manipulation of the number of receptors . 
expressed per cell (Staunton et al . , 1990). 

Major group human rhinovi ruses (HRVs) provide an example 
of the pathogenic agents that affect eukaryotic cells and are 
suitable for use in this invention. HRVs are members of 
Picornaviridac, small protei n- oncaps ida ted viruses with RNA 
genomes. Of 100 serotypes of rhinovirus, 89 use an extremely 
well-characterized glycoprotein, 3CAM-1 (CD-54) as their 
receptor. HRV serotype 14 is well studied , easily propagated 
in the lab, and therefore suitable for use in this invention 
(Staunton et al., 1990, Couch, 1996). Major group HRVs grow 
efficiently in many different lines of human cell culture. The 
most commonly used tissues are WI-38 diploid fibroblast, fetal 
tonsil, MRC, HeLa and COS cell lines. Optimal conditions and 
protocols for growth are described in the literature (Couch, 
1996). HRVs form plaques on monolayers of a variety of cell 
lines exposed by any one of the described techniques for 
exposure to viral agents. Infected cells yield 10 to 200 
plaque forming units per cell after 10 to 12 hours of 
incubation (Couch, 1996) . 

The structure oi ni^ v j i no.> wv. ~j -— j 
crystallography (Staunton et al., 1990). The capsid is 30 nm 
in diameter, exhibiting icosahedral symmetry and a canyon, 12- 
30 A wide and 2 9 A deep, that runs along the 5-fold axis and 
contains the pathogen's receptor binding site. The receptor i.« 
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the wel 1 -character i zed member of the immunoglobulin 
super/family, ICAM-1 (CD-S4), a 532 residue, 19nm x 2-3 nm rod 
comprising h immunoglobul i n-like domains arranged in a line 
with a hinge between domains 2 and 3 (figure 4). Viral binding 
involves insert: ion of the distal immunoglobulin domain (Dl) 
into the viral canyon. This mode of interaction is extremely 
vulnerable to disturbance by steric inhibition (Staunton et 
al . , 19 c >0). Thus, although the lower three immunoglobul in - 1 i ke 
domains { I H - P 1 o are not directly involved in viral binding, 
and mutations within them do not disturb adhesion, their 
complete deletion does inhibit adhesion by reducing 
accessibility ot Dl . 

ICAM-1 bear. 4 : multiple N-l inked oligosaccharides; and 
variation in g 1 yoosy 1 a t ion between cell -Types Jesuits in I C AM- 
I'S molecular mass heterogeneity of 76 to 11-1 kL). It functions 
as the cell-surface ligand for lymphocyte function-associated 
antigen 1 (LFA-1), and binding between ICAM-1 and LFA-1 
contributes to leukocyte adhesion in several immunological and 
inflammatory processes (Staunton et al . , 1990). Cytokines 
secreted by T- 1 ymphocytes and monocytes, including IFN-g, TNF- 
a and b, and and b, induce? expression of 3.o x 10^ copies 

per ondot. ho 1 l <i 1 cell. Howovei, miindut'ed colls (exhibit much 
i')wei irvtd;, u! i (.'AM- .1 , and in tin-- CbM*i expression system, 
ICAM-1 can be exogenously regulated through control of an 
i ndiic i b i < ■ p r i iiiu »t » - 1 (.'if rii]i)t (in * a :i i . / i eeo» wh i ch makes this 
system excel lentely suitable tor the 1 unction of this 
invent i on . 

Is ,i p r * - t ♦ * i r ' 1 : i embodiment of the pr osent: invention/ t he 
rm >t hud is . • i r r n ■< i <m p ; i . f h<- t w m ■ t i hi oh t h i o;i ( jhf ui t : < ■ i * i 1 ! n< 
ti ss. 1 y wiiii »'•]!: s i os < - 1 I t >< m -u ! , : t i ( >]\ i : .'vj „ t(i kl multiplicity 

of triid'd '-onipounfis t r ( -m wise!; ,m unknown bindmu partnei lor d 
polypeptide of interest is. to be identified and isolated. The 
t a r c j o t compound in this embodiment noy be (i single definite 
* '( nit ounJ, .j mixture e i o- :n i p'. .'in ids < , t ■ . have ,j 1 , i : . ;» • va i :o y • : 
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genomic information is available, such as Mycobacterium 
tuberculosis or Helicobacter. 

The method according to the present invention is usually 
carried out by performing steps a) to f) as described above. It 
may be necessary to repeat the selection process, e.g., with an 
.ncrease in the selection pressure. This is suitable if, e.g., 
the number of isolated, positive clones is too large to handle. 

in such cases, it is preferred to repeat steps b) through 

. ■. . . an i nrrPRSP in the 

e) at least once, preieiauiy ^m^*. — * — - - 

selection pressure in the repeated steps. 

The method can be carried out by steps b) through e) until 
the cell population comprises a desired number of distinct 

clones . . 

The term "background" is used to refer to cells that fail 

to bind the target compound but nonetheless remain in the 

selected population. In general, it is desireable to minimize 

the number of background cells, so that CISTEM reliably 

eliminates cells whose imprinted domains fail to bind the target 

compound, while selecting those cells whose imprinted domains do 

bind the target. In one strategy for the reduction of 

background, the CI STEM cell population is supplemented with 

-helper cells", whose function is propagation and amplification 

of the selective agent. This amplification of the selective 

agent increases the intensity of selection, and thereby reduces 

residual background cells. The helper cells lack a selectable 

marker that is present in the rest of the CISTEM cell 

population; this permits efficient elimination of the helper 

cells after they have served their purpose of amplifying the 

selective agent. 

Supplemental cells ,nay also be used in other ways to regulate 
the action of the selective agent. For example, to stop the 
action of the agent, heat-killed cells bearing the agent's 
receptor may be added to the CISTEM cell population. These cells 
bind the agent, removing them from selective activity m the 
remaining CISTEM cell population. 

When repeating the steps, new sequence variations can be 
introduced in the variable regions of clones selected an step d) 
before steps b) to e) are repeated. 
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The screening test may also be ( addi t ional-Jry ) performed by 
functional parameters. In a specific embodiment of the present 
invention, the polypeptide of interest binds to the target 
compound and affects its function. Then, the polypeptides 
exhibited by such clones selected in step d) and amplified in 
step e) are screened by their ability to affect the function of 
the target compound. 

An additional step of screening is possible, if the 
polypeptide of interest exhibits an enzymatic activity. In such 
a case, the method according to the present invention comprises 
a step wherein the? cell population is placed in a nutrient 
medium in which each cell's supply of a vital compound depends 
upon that cell's variable region acting enzyma t ica 1 1 y upon a 
compound structurally similar, but not. identical to the target 
compound. 

This invention provides a method for identifying and 
isolating a polypeptide of interest or a target compound or a 
gene of said polypeptide or target, using the detection of the 
interaction between a polypeptide of interest and a target 
compounrl . In this interaction, at least: one of the 
interactants is unknown. Tlu- 1 method allows for identification 
of the unknown i nt • > r act i on partner, i.e. -i polypeptide of 
into res* und/o: t h • • * a i q e * compound. Ke- : some embodiments, 
both interactants may be unknown, e.g. when an unknown ligand, 
upon bin i l nq -a * ,.i \ -■.'> ;.• \ ;it» , - - >: e : t s ii e.e:t.<iiii el. iect on 

the cell which needs to be ant. agon i. zed or interfered with and 
both the ligand and the receptor domain to which it binds need 
To be identified. ( The worn: "identify" and " ident i I ieat ion" 
,ne us«»d her*o as in th<- r'-.'-l ef t h" sj •* ■ «■ - 1 t j e« i t ion, te denote 
both the di s« -t)Ve t y of f -o 1 Vp*d' t i do s ol inteios! -hut exist in 
nature, and tin- dosjqn o! polypept ides, th.it have no natural 
ocou i onro . ) Knibod i nuMi t s <d this met h cm J ale ca 1 1 od '■ : T CTEMs , an 
a c r en y m tor < ' 7 * el 1 u 1 a I I mp i i nt l n q by ' > e 1 < • c t ion: T « * 1 e - > 1 o q l c a 1 
Kv-. > 1 u t o . n ■ i-li'li' \i b . . A ■'!.'; TEM npf d ■ . • . ; . t i . - t . v i ? : • ■ . 
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comprising proteins, glycoproteins, proteoglycans, etc., 
presented in a wide range of sizes, on a diverse array of 
underlying structures, and by any one of several prokaryotic 
or eukaryotic cell cultures (figure 1). 

The terms .interaction" and .interact" for the purpose of 
the present invention denote any action of the -target compound 
upon the polypeptide of interest (in the extra- or 
intracellular domain of the membrane protein) and that alters 

, ni*- 0 cnfh that the selection agent is 
or occiuaes tue ^ <= -l <= ^ ^ w • 

unable to utilize said selection site. 

In the specific embodiment wherein the selection site is 
located on an adhesion domain, the binding of the target 
compound prevents the action of the selection agent via 
intramolecular interaction (in the membrane polypeptide), e.g. 
by occluding the .election agent from its adhesion domain. 
This may comprise binding, adhesion, structural changes or 
chemical modifications of the variable domain which has an 
impact on the adhesion domain. Such impact on the adhesion 
domain may, inter alia, be a conformational change of the % 
adhesion domain that prevents the harmful action of the , 

selection agent. ■ _ 

The invention is useful in at least three general areas. 
First, the invention may be employed to identify polypeptides 
with properties that are useful in medical diagnostics, 
therapeutics, research, laboratory procedures, or other 
applications. Among the polypeptides that may be identified by 
the method disclosed here are polypeptides that interact with, 
in particular bind with high affinity and specificity to, 
virtually any given target compound, polypeptides that adhere 
to their respective target compounds at multiple epitopes, 
polypeptides that bind their respective target compounds but 
show no affinity for compounds closely related to the targets, 
enzymatic polypeptides, and polypeptides that show biological 
activities, such as antagonism or agonism of a receptor, 
seeond. bv identifying polypeptide sequences that interact 
specifically with a target that is a known compound of 
significant interest, this invention may assist in the 
identification of the know.) compound's native interactants . 
Third thrs invention may be used to analyse the interaction 
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between a known polypeptide of interest and a known target 
molecule. By identifying which sequences in the polypeptide of 
interest interact with the target molecule, the invention 
assists in mapping the polypeptide epitopes that play a role 
in the interaction. In a further embodiment, the method of the 
invention may be used to screen a population of. target 
compounds and to identify those compounds that bind to a known 
polypeptide. In this embodiment, cells displaying a given 
section in the extra- or intracellular domain of the membrane 
polypeptide are used in a screening assay as test cells to 
screen tor e.g. pliar maceut ical 1 y active compounds that 
interact with the polypeptide of interest. 

This invention may also be used to identify polypeptides 
with more sophisticated properties of interaction with the 
target compound than prescribed binding affinity. Among the 
molecules that may be identified by this method are the 
following: polypeptides that exhibit multivalent binding; 
polypeptides that bind tightly to one molecule but show no 
affinity for a second, closely related molecule; polypeptides 
that affect the functional activity of the target compound to 
which they bind; and enzymatic polypeptides. 

The invention is based on the interaction of three CISTEM 
component::: (1) a genetically engineered population of 
prokaryotic or eukaryotic cells; (2) a target compound; and 
M> H selection agent ;tigu:< on the cur lace ol each cell 

in the population, there are - at least - two polypeptide 
domains involved in the operation of the CISTEM: the cellular 
domain carrying eithei a given polypeptide whoso interaction 
with an unknown target compound is to be determined or, in the 
(: ' 1 '' ;f ' of <l!l unknown p-» 1 ypopl i do, the imprinted domain, and the 
selection site, e.u. the adhesion domain. The imprinted domain 
is highly variable, beeaiise the <~. I STEM cell population is 
engineered to display up to at least 1 0 1 distinct 
' ?J 1 dnpert ides a! the imp] int. -d domain. (It the population is 
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neighboring the selection site, e.g. on an adhesion domain, 
the structure to which the CISTEM' s selection agent must bind 
in order to exert its effects upon the cell. In the case of a 
pathogenic agent, this may be an organism or molecule, and it 
may be fatal to cells or merely retardative of their growth, 
but the first step in the agent's destruction or retardation 
of a cell is necessarily binding between the agent and its 
adhesion domain. The two important domains - namely, the 

expressed in 

imprinted aomam cinu tiic ca^^~^^- 

multiple copies on the surface of each cell, but they are - in 
the case of direct interaction - normally adjacent to each 
,,ther: more specifically, they must be sufficiently close to 
each other that the target compound, when interacting with the , 
imprinted domain, either extends far enough over the adhesion 
domain that adhesion of the selection agent is at least 
partially impeded, or exerts its effect on the selection site 
in a different way (e.g. by intramolecular steric alteration 
of the selection site) . 

The target compound is the molecule or set of molecules 
with which the polypeptide sequences identified by the CISTEM 
must interact in the prescribed manner in order to exhibit the 
properties or perform the functions desired of them. In other 
words, the CISTEM identifies polypeptides that interact with, 
e.g. bind to, and optionally also structurally or functionally 
alter, the target compound. A target compound can be any 
molecule, especially a protein, carbohydrate, saccharide, 
glycoprotein, proteoglycan, hormone, receptor, antigen, 
antibody, pathogen, substrate, metabolite, transition state 
analog, cofactor, inhibitor, drug, dye, nutrient, small 
molecular compound, etc., without limitation. Each CISTEM 
places only one requirement on the target compound: it must be 
effective in affecting the change in the selectron site upon 
binding to the polypeptide of interest, e.g. that it is 
sufficiently large that its attachment to the imprinted domain 
. ce „i^ in occlusion of the adjacent adhesion domain or that 
its binding affects the structure of the membrane protein such 
that the selection srte is sufficiently changed to prevent or 
to allow action of the selective agent. 
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To assemble the most basic CISTEM (exemplified for the 
embodiment wherein the unknown interactant is the polypeptide) 
and initiate selection, the following steps are performed. 
First, C I STEM component (1), the cell population, is incubated 
with C I STEM component (2), a solution or suspension of the 
target compound. During exposure of the CISTEM cell population 
to the target compound, the structural diversity of the 
imprinted polypeptides' variable regions results in a wide 
range of interactions between imprinted domains and target 
molecules. That is, the distinct structures displayed at the 
imprinted domains of separate cells exhibit characteristic on- 
rates, off-rates, and dissociation constants in their binding 
reactions with the target (figure 6) . After or during 
incubation, CISTEM component (3) is introduced: a population 
of bacterioom, bacteriophage, virus, or toxin particles is 
added to the cells and target compound. With this step, 
assembly of the basic CISTEM is completed, and selection 
commences . 

Interaction among the three CISTEM components confers a 
selective advantage upon cells whose imprinted domains bind 
the target compound (figure 7). This takes place as follows. 
As described above, if tin..' imprinted domain ( in the following, 
when describing stfuctuj.il features of the invention, the term 
^imprinted domain" is also used for the domain carrying a 
known p' > 1 yp-pt l d» ■ ol interest whose interaction with an 
unknown target compound is to be determined. Hence, the term 
^imprinted polypeptide" may also denote proteins carrying such 
known polypeptide of interest) and the se Lection site are 
immediate ieoqhbois on the eel J surface, e.g i 1 t.ho selection 
site \r on t li'- 1 adhesion domain, the interaction upon bmdinq 
may be performed in a very definite and easily assembled 
manner . Consequent .1 y / when the tarijet compound binds to a 
structure lor-.itod -.it th" imprinted domain, the compound can 
( . w , -j i 1 1 1 .' the ; a l I t .):'!■ o ! ! f : i ' l-if'Oil « i < i I n ■ :; i e>n d< mid ; n with t tit • 
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therefore selected, e.g. killed or retarded, by-the selection 
agent (figure 7). Thus, each selected clone increases in 
frequency and in number as it continues to divide, while 
unprotected clones are afflicted by the selection agent. From 
the cell population displaying a great diversity of structures 
at the imprinted domain, the CISTEM selects and..ampli f ies 
clones expressing structures that bind the target compound 
( f igure 8 ) . 

Modifications ana ewDoiduuno u..o ^a~-~ 

selection for a prescribed affinity or nature of interaction 
of the imprinted domain with the target. Multivalent binding 

Ho .piprted. as may highly specific binding, binding, at 
predetermined sites, or binding by specified structural 
motifs. Moreover, CISTEMs may be extended to select not only 
for binding, but also for functional activity; these extended 
CISTEMs may be used to create enzymes and other functionally 

active molecules. 

As stated above, the embodiments of the present invention 
are called CISTEMs, for Cellular Imprinting by Selection: 
Teleological Evolution of Molecules. In general, a CISTEM 
consists of three components. They are: (1) an engineered 
population of cells; (2) a target compound; and (3) a 
selection agent (figure 5). 

The CISTEM cell population may be prokaryotic or 
eukaryotic. In case the unknown interactant is the polypeptide 
of interest, in the CISTEM cell population, one extra- or 
intracellular domain (the "imprinted domain") of a membrane 
polypeptide (the "imprinted polypeptide") is engineered to 
exhibit highly variable peptide sequences. If the CISTEM cell 
population is eukaryotic, glycosylat ion sites within the 
imprinted domain may be adorned by variable complex 
saccharides. High sequence variability at the imprinted domain 
is achieved through the insertion of variable oligonucleotides 
at the sites in the polypeptide gene that correspond to 
translated regions in which variation is desired. These 
variable oligonucleotides may be cDNAs , random sequences, or 
any other form of oligonucleotides. Techniques for inserting a 
gene for an exogenous oligopeptide at a targeted site in an 
outer membrane protein gene were described above, and an 
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extension of these techniques, which allows t he-— i nsert ion of 

5 12 

variable oligonucleotides (at least 10 to 10 different 
sequences, instead of a single, stereotyped nucleotide 
sequence), will be described in detail below. 

As was described above, many pathogenic agents can kill 
or retard the growth of cells only if they can first bind to a 
specific domain on the cell surface. A CISTEM's selection 
agent may bind to an adhesion domain that is located 
i mme d i a t e 1 y adjacent to t h e imprinted doma in on the surface of 
the cells in the CI STEM population (figure 5). The adjacent 
extra- oi intracellular domains in most of the membrane 
proteins used in this invention are separated by 3 - 30 A. In 
the preferred embodiments of this invention, which are 
described in qreatcr detail below, this arrangement of domains 
is the result of carefully choosing an imprinted polypeptide- 
and a selection agent such that the imprinted domain is 
immediately adjacent to the adhesion domain. As was described 
above, almost every membrane polypeptide serves as the 
receptor for multiple selection agents, each of which utilizes 
a distinctive adhesion domain. However, in other embodiments 
of this invention, membrane proteins are genetically altered 
in order to construct a hybrid polypeptide in winch the 
imprinted domain is placed beside the adhesion domain. The 
t echni ques f or eng inoeri ng membrane prot e i nr. , i ncludi rig the 
lu: ion t W( .era i a : e ] >o i y; >ep* ides to toim a chimeric 

polytopic polypeptide, were described above; their specific 
applications to embodiments of this invention are described 
be 1 ow . 

The t.iiM"' compound is any moleeub- e: set <•? mult iplo, 
distinct nieb'cules, which aie s imply addea tt> the cell 
population according to the ptescn* invention or, in some 
embe d l merit .* . of this invention, nje serieied by the cells, 
t hi r -mr.(dve:;. The present mvoit juii l dent, if i«\<; and selects (•♦ells, 
Mi w h i • • ' ; i ■ } . < i i s| i a v • ■ d e • i y ; ■ • c ' id" s » • < i '. i " J 1 • ■ < • 1 n t « ■ i i s t s w i t t . t h • ■ 
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this invention to the analysis of molecular 

target compound may be a known compound of Merest that xs 
thought to have important interactions with unknown 
polypeptides. Identif icatxon of these unknown polypeptides is 
assisted by this invention's identification of sequences wxth 
Tich the target compound xnteracts . In 

1 i« a molecule of interest (not necessarily a 
target compound is a moxecuxe interaction 
polypeptide), which is known to be involved m an interaction 
P yF ^ . , _* + rv identifying polypeptide 

with a polypeptide ox x.^e^ 

. f . rflpt this invention assists in 

sequences that bind the target, .ni- 

delineating the regions of the known polypeptide that are 
a! i^Mv involved an its interaction wxth the target compound. 
L'each of the three areas of applications that have ne . 
mentioned here - xdent x f icatxon of sequences wxth useful 
functions; interaction analysis; and epitope mapping - 
specxfic target compounds are descrxbed herexnafter 

The third component of the invention is the selection 
agent The most important reguxrement of the selection agent 
his already been stated: the agent ■ s effects upon cells mus^ 

e^ent upon the agent, ability to 
site which xs modxfxed upon the bxnding event of the target 

nd Thxs selectxon site xs preferably an adhesxon domaxn 
compound. This seietnu a ff r ibutes of 

that is adjacent to the imprinted domain, other attributes 
the selection agent, e.g. an pathogenic agent, such as 
^rence and concentration, may he altered, if necessary, by 
cenu.cn laheratory procedures. These attributes and their 
adjustment have already been described. 

There are two steps in the execution o£ selector for 
polypeptide seguences that bind the target compound. Inmost 
"embodiments ot thrs Invention, these steps are per or 
seguentiafly, though they may in some cases be rmultaneous. 
First, the CISTEM cell population, component (1) of the 

incubated with a suspension or solution of the 
invention, >s incubateo wil.i „„i u tion 
target compound, component ,2, of the invent ion . The solutio 
of ^cuhation is a standard medium for cell growth, described 
Z m'ore'detaif in the passages relating to eel! lines and 
media. During incubation, the structural diversity of the 
variable regions results in a wide range of bindlnq 
interactions between imprinted domains and target molecules. 
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the distinct structures displayed at the imprintred domains of 
separate? cells exhibit characteristic on-rates, off-rates, and 
dissociation constants in their binding reactions with the 
target (figure 6). Incubation continues until the binding 
between imprinted domains and target compounds approaches 
equilibrium levels. For most embodiments of the invention, one 
hour is sufficient. In certain embodiments of the invention, 
the? period of incubation is cut short in order to favor 
polypeptides that exhibit not only a high affinity for the 
target, but also exceptionally high on-rates; these CISTEMs 
are described below. 

After incubation, the selection agent, component (3) of 
the invention, is added to the suspension of cells and target 
compound. Exposure of the cell population is performed by a 
method appropriate to the agent. For example, if the agent is 
a bacteriophage, the suspension of cells and target compound 
is exposed to classical phage infection. Alternatively, if the 
agent is a colicin, exposure of the cell population to the 
agent is performed simply by adding the colicin to the medium 
in which the cells are incubated with the target compound. 
When the selection agent in the form of a pathogenic agent is 
a reproducing organism, such as a bacteriophage, it may be 
necessary : o stop its action after a period of exposure. The 
appropriate concentration of pathogenic: agent, period of 
t-xposui and pioo<-duies lui halting t h»- -vt ion ni use agent 
are in the general knowledge of the skilled man in the art. 

Addition of the select ion agent to the suspension of 
cells and taiget compound completes assembly of tho el STEM, 
and select ion common co s . Interaction among the throe CI STEM 
components eonteis a select l vo advantage upon cells whose 
imprinted domains bind the tarqet compound: e.g. when a 
structure located at the imprinted domain binds the target 
compound, t ho compound ovei laps t he suriaee of th»- adjacent 

« i d h • S ' ■: ■ , d i I S . o ' ■ « ill: tli- d i : • t , i r 1 < ' ♦ • \ > » ' \ W »'»Mi eld] : : 1 •.. id f'Xt la 
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selection agent's attachment, protecting the celT' and allowing 
it to multiply (figure 7). Cells whose imprinted domains do 
not bind the target compound with adequate affinity remain 
unprotected, and, consequently, they are killed or retarded by 
the pathogenic (selection) agent (figure 7). 

Each cell in the CISTEM population expresses multiple 
copies of its imprinted polypeptide, and binding is a dynamic 
process balancing on^rate and off -rate. Therefore even when a 

,_^^d nolvoeotide exhibits high affinity for the 

taruer compound, some copies of the imprinted polypeptide are 
left unprotected some of the time. Consequently, every cell in 
a CISTEM population, no matter how high its affinity for the 
target compound, has some probability of being hit by a 
selection agent. Nonetheless, this probability is lower when a 
-ell'" aff'ni*"" f^r the target compound is higher. 
Furthermore, because each clone in a CISTEM is present in 
multiple copies (the C I STEM cell population is redundant) the 
probability that a single cell will be hit is also the 
fraction of that cell's clonal population that will be hit 
and the differential elimination of clonal populations is the 
very basis of selection. Thus, each selected clone increases 
in frequency and in number as it continues to divide, whiLe 
unprotected clones are destroyed. In short, the CISTEM selects 
and amplifies clones expressing structures that bind the 
target compound (figure 8). 

The effect of the target compound relevant for the 
preferred embedment of the invention, wherein the selection 
site is located on an adhesion domain is that, due to its 
si , e it extends, upon interaction with the imprinted domain, 
from the imprinted domain to the adhesion domain. This is 3 - 
30 A for many imprinted polypeptides imprinted by this 
invention. Alternatively, independent of its size, the 
interaction between the target compound and the imprinted 
domain may lead to a conformational and/or chemical change of 
the variable domain and/or the adhesion domain which prevents 
the selection agent from attachment. The target compounds, 
their source, preparation, and use of CISTEMs with particular 
applications in this specific embodiment of the invention are 
described hereinafter. While the concentration of target 
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compound in a C1STEM is also important to its .function, this 
concentration depends on the nature of other CISTEM 
components, and is therefore discussed below. 

In CI STEMs designed to imprint polypeptides to serve as 
highly specific markers or partitioning agents in diagnostic 
assays, the target compound is simply the molecule that is to 
be tagged and/or partitioned by the polypeptide. For example, 
in an embodiment of the invention that is used in the design 
ot a diagnostic polypeptide that binds tightly and 
specifically to Human Immunodeficiency Virus (HIV), the target 
compound is HIV gpl20, the extracellular subunit ot a major 
HIV glycoprotein (Fantini et al., 1997). The production of 
recombinant gp!20 in a soluble form that is suitable for use 
in CI STEMs is described in the literature (Fantini et al., 
1997) . Another embodiment of the invention is used to design a 
diagnostic polypeptide for the direct detection of a different 
virus, Human T-eell Leukemia Virus type 1 (HTLV-1); here the 
target compound is the HTLV-1 surface glycoprotein gp46. The 
production of authentic/ glycosylated HTLV-1 gp4 6 in a 
recombinant vaccinia virus/T7 polymerase system is described 
in the lit e r a t u r e ( Ar p e t a 1 . , 19 9 6 ) . Diagnosti c a s s ays for 
de t i no human viral infection often bind not to the virus 
itsf-1:, but rathe? to the a n t. i h< >d i e- raised against thf 
infectious agent . One preferred embodiment of this invention 
imprints a p<- 1 ype r * : t . >:hih: : high al : inrt y : ■.. : <: 
monoclonal antibody, MAbT 7 . The target compound in this 
CISTEM, MAbT 7 itself, is commercially available (Novagene). 

The invention is also used r-o imprint polypeptide's to 
servo as an t au; >n l s t s , agonists, or modulators oJ the 
! u 1 1 • " t ; on.t 1 act ion o I b i o J o»g j ca 1 1 y aot. l ve rn< >\ ecu 1 es . When this 
invent ion is used to imp] int a polypep; ide to serve as an 
antagonist , the t a r got- compound in -ay be either o! the molecules 
involved in the int exact ion to be antagonized. HIV up 1/0 is. 
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serve as an antagonist of the interaction between the viral 
glycoprotein and its host cell receptor. 

The polypeptides selected by a CISTEM are more likely to 
functionally antagonize a given molecular interaction when the 
ClSTEM's target compound is pared down, so far as is possible, 
to one interactant's active site. Structures on either 
interactant that are distinct and topologi cally distant from 
the active site are less promising as target compounds because 

. ■ , A . V _ ^ - o>.^k ctmrfijrps ma v not interfere 
a poiypeptiae uiiau j^**^- 

with the normal interaction of the active molecules. For 
example, one embodiment of this invention imprints a 
polypeptide ( E . coli OmpA, which is depicted in figure 13., and 
described further below) to function as an antagonist of the 
interaction between the eukaryotic ceir-s i gnal 1 ing proteins 
Ras and Raf (Block et al . , 1996). (Activation of Raf, a 
protein kinase, by the immediately upstream protein, Ras, 
initiates pathways leading to the phosphorylation of 
transcription factors, the stimulation of numerous genes, and, 
ultimately, to cell proliferation and differentiation. 
Constitutively active oncogenic mutants of Ras or Raf occur in 
a large number of human tumors (Block et al . , 1996).) In -this 
CISTEM, the target compound is the domain of Raf ( amino /acids 
51-131) that is known to be directly involved in binding to 
Ras (Rafs so-called Ras-binding domain, or Raf-RBD) (Block et 
al., 1996). The construction of a recombinant expression 
vector for Raf-RBD, expression in E. coli, and purification of 
the protein are described in the literature (Block et al . , 
1996) . Using Raf-RBD, rather than Raf in its entirety, as the 
target compound ensures that the polypeptides selected by the 
CISTEM bind to precisely that domain of Raf which interacts 
with Ras, rather than to Raf epitopes that are extraneous to 
the important cell-signalling interaction. 

When this invention is used to imprint polypeptides to 
serve as agonists, rather than antagonists, of a receptor or 
bioloaical pathway, the target compound is the receptor or 
biomolecule that the imprinted polypeptide is intended to 
activate. For example, in an embodiment of this invention that 
imprints a polypeptide (E. coli OmpA) to mimic interleukin 6 
and function as an agonist of the interleukin 6 receptor (IL- 
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6R) , the target compound is a solubilized f orm_6T' 1 L- 6R : the 
expression of soluble human IL-6R an CHO cells, and the 
protein's purification, are described in the literature 
(Yasukawa et al., 1990). After this C15TEM selects 
polypeptides with affinity for IL-6R, a secondary screen is 
used to isolate polypeptides whose binding to the receptor 
results m transduction of the signal that is normally 
initiated by IL-6. This secondary screen is 'described in the 
passages relating to extensions of the basic CISTEM. 

When this invention is used to create agonists, the 
target compound is pared, so fax as is possible, to the active 
site of the receptor or biomolecule that is to be agonized. 
This is done because removal of extraneous epitopes from the 
target compound increases the probabi 1-i-ty that an imprinted 
polypeptide that binds the target will also be active as an 
agonist . 

In another example of this invention's application to the 
creation of agonists, C I STEM is used to identify, or to create 
mimics of, specific antigens that are present in infectious 
pathogens or cancer. These antigens or antigen mimics are then 
often used to provoke a specific immune response against the 
pathogen or tumor. In these embodiments of the invention, the 
target compound is composed of :nor o than one molecule: 
multiple monoclonal antibodies, speoi f ic immune receptors, 
such as cytotoxic T- lymphocyte receptors or MHO I molecule::, 
or polyclonal antisera derived from individuals exposed to a 
specific antigen, such as meninqieooca 1 Opa proteins, to a 
spec: i ic pathogen, such as me; i 1 ng i c ■< h •( - i , <)r { o a t umor , such, 
as melanoma. pMlyr.loii.il antiser a <n *> divided f rom animals, 
human volunteeis mi .irtii.d pat Jen's. This <u uupc >s i f e taigot is 
used as an i mpi i.is;; j on a "negdt i vr", m " i oot.pr i nt " , oi the 
antigen - and from this footprint, the CIliTHM is used a:; 
described in example 2 to identity the antigen or to create a 
ni i m i 1 ' ( > f t 1 1 < - antigen. An a n t i g r • n ■ m i m i • ■ is < ■ l l n t ? ■■ r e : ; f h* r a u s » • 
i t ■ ■ ! vi »• t e i d* t h,a i n:i a i T: t . ■= ! ♦ e M . j » \ , , , P t - \ , , a , . . ■ 
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protein Opa 5c include mAb B306, mAb A222/5b, and others 
described in the literature (Olyhoek et al . , 1991). Antibodies 
for use as the target compound in a CISTEM for identifying 
tumor antigens are derived from the polyclonal serum of a 
cancer patient, the isolation of which is widely described in 

the literature. 

in order to create catalysts of chemical reactions, such 
as ester hydrolysis, polypeptides are imprinted to bind to 
^4-,«- 0 .r^ioc^ The preparation and structure of 
these transition state analogs are described in the literature 
(e g Charbonnier et al., 1995). In one embodiment of this 
invention, a phosphonate transition-state analog (structure 
and preparation described in Charbonnier et al., 1995) is used 
as the target compound for imprinting ET coll OmpA to function 
as a catalyst of the hydrolysis of p-nitrophenyl esters. 

In other embodiments of this invention that are used to 
create polypeptides with specific enzymatic activity, the 
target compound is not a transition state analog, but rather a 
modified substrate. The target is a molecule that is 
structurally very similar, but not identical, to the substrate 
upon which the imprinted polypeptide will ultimately act . The 
reason for using a slightly modified structure, rather than 
the substrate itself, is the following: if the substrate 
itself were used as the target compound, imprinted 
polypeptides that not only bound the substrate, but also 
performed the desired enzymatic activity, would alter the 
substrate and then release it, rendering the underlying cell 
sensible to the selection agent. Thus, if the substrate itself 
were used as the target compound, the CI STEM would select 
against clones expressing polypeptides with the desired 
enzymatic activity. Therefore, the target compound must be an 
effective structural mimic of the substrate (i.e. polypeptides 
that bind the target must also be able to bind the substrate), 
but it must be invulnerable to the prescribed enzymatic 
activity (i.e. polypeptides that act enzymat i cal 1 y on the 
substrate must be unable to act on the target compound,. 

For example, in an embodiment of this invention that 
imprints E. coll OmpA to function as a sequence-specific 
protease that hydrolyzes the bond between two methionines, the 
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target compound is the oligopeptide NH2 -R 1 -R2-R3-M1=M2 -R4 -R5- 
COOH . Ml and M2 represent methionine residues; the bond 
between them, which is represented by "=" , is a 
nonhydroiyzable methy lene-gr oup bond. Normal amide bonds are 
represented by " - " . R1...S are variable residues, which are 
randomly chosen from the 20 amino acids. These residues are 1 
varied so that only imprinted polypeptides that bind to the 
consistent M1=M2 motif of the target compound are selected by 
the CI STEM. Imprinted polypeptides that show high affinity for 
a different; peptide motif bind at most only a tiny fraction of 
the target polypeptides, and are therefore left vulnerable to 
the selection agent. Several methods for the 1 synthesis of this 
oligopeptide target compound are described in the literature 
( Janda, 1 994 ) . — 

After the CI STEM has selected clones expressing imprinted 
polypeptides that bind the modified substrate, a secondary 
screen may be applied to isolate imprinted pol ypep t ides that 
not only bind the target, but also perform the desired 
enzymatic reaction. Secondary screens, including the one used 
in the C1STEM that imprints polypeptides to hydrolyze the bond 
between two methionines, are descr ibed below. 

In addition to the creation of functionally active 
polypeptides, such as. an t a g on i s * . ; t md enzymes, this 
invention 1 s app 1 i cat ions inci ude i nt e rac t ion anal ys l s (current 
f ecririi que r T<v:ew-1 ; n I'M :' i .'Ky i . , 199sj. by scr o^unu a 

diverse library ol polypeptides toi .sequences that interact 
specifically with a target compound of significant interest, 
tin ; l nvon t i en < i s : ' ' s \ s in t h> - \ d« i t i I l ■ m t i on c 1 1 t compound ' s 
n a t i v e ml ct ,irt - mt :, . >•'< u ♦ ■ >:< i n; p J * ■ , , i 1 t I 1. j u <j h tin- r; t i net u r<t 1 
i n t > 1 r 1 1 c : i : i b i - 1 w » ■ i ■ : > 1 c i . . find ii.it ■ K M T h « i s b * 1 e 1 1 w e 1 i - 
cha rac: t.e r l zed through c t ys t a L 1 ( >g r f g >h y rind mut. a t ion a 1 analysis 
(Block et ri 1 . , 1906), t hp -ictivution ot Kaf by Has is a 
complex process tli.it is. not y<a completely undci :;t ood, and 
p:-b,ib!y i ■]••.-. « * : . ; .:.»:, . ♦ h • • : m- < 1 . c u 1 i . . . Th" • ■ ■ . ;"!"KM 
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cell expressing Ras and Raf (the insertion of "fp reign 
oligonucleotides into the gene for OmpA is described below); 
Raf-RBD (see above for a more detailed description of this 
target compound and its preparation) serves as the target 
compound; and phage K3 serves as the (pathogenic) selection 
agent. The appropriate concentrations of the target compound 
and the selection agent can easily be determined by the 
skilled man in the art after reading the present specification 
and are described below in the examples of OmpA imprinting 

1 " This invention is also used for epitope mapping (Phizicky 
et al., 1995), serving to elucidate the surface of structural 
interaction between two molecules. In CISTEMs for epitope 
mapping, the target compound is one of -the two molecules whose 
interaction is to be mapped. Usually it is the interactant _ 
that is most easily prepared in soluble form that serves as 
the target compound. Thus, if one of the two interactants is a 
membrane protein, it is often the other molecule that serves 
as the target. In a CISTEM used to map the epitopes of 
interaction between Ras and Raf-RBD, Raf-RBD serves as the: 
target compound. As noted above, it is readily prepared in 
soluble form (Block et al . , 1996). Thus, one of the 
interacting proteins is dissected into multiple smaller 
domains which are displayed in the imprinted domain and 
selected by the CISTEM for their capability to bind to the 

other interactant. 

In certain embodiments of this invention, multiple 
separate molecules compose the target compound. Each molecule 
must meet the CI STEM ' s size requirement and must be present in 
the CISTEM at a concentration sufficient to confer protection 
upon clones that bind to it with high affinity. (Selection by 
C I STEMs with multiple distinct target molecules strongly 
favors binding by imprinted domains to multiple distinct 
molecules, because the effective concentration of target 
compound rises significantly with each additional compound 
that the imprinted domain is capable of binding.) When the 
target compound is a library of mol eculesand the imprinted 
domain displays variable regions, the CISTEM assists in the 
identification of interacting pairs of molecules: polypeptide. 



BNSOOCID <WO 99301 51 A 1 t > 



WO 99/30151 



PCT/EP98/07922 



- 4 5 - 

expressed in the imprinted domain are selected for their 
a f f m i t y for parti cul a r members of the library of molecules 
that composes the target compound. Those cells that express an 
imprinted polypeptide that binds to one or more molecules in 
the target compound are selected, while cells expressing an 
imprinted po 1 ypep t i de that fails to bind any component of the 
target compound are eliminated by the CISTEM's selection 
agent. In certain embodiments of this invention; which are 
described below, the target compound consists of a library of 
pol ypept ides or phage-displayed polypeptides that are secreted 
by the CI STEM cell population itself. 

In an embodiment of the invention, the target compound is 
the interaction partner that is to be identified due to its 
interaction with a given polypeptide oi — interest, e.g. a 
section of a naturally occuring cell surface receptor which is 
displayed in the extracellular domain. In this case, the 
target compound is the unknown mteractant . In this example, 
the cells presenting the receptor domain o f interest may be 
incubate*! with chemical compounds from a pool of substances . 
Surviving cells are indicators that the compound that has 
interacted with these cells is a candidate- substance for 
l n t o r a c t r o n with t h e r e c e p > t o i dom a l n . 

In a further embodiment of the invention cell:; are 
protected from the (pathogenic) selection agent, in an 
autocr in e fashion, d u e t o b i n < i i no between t h e i rr.p r in! e d dom a i r. 
and ci polypeptide target tnat in secreted by the cell itself. 
In t h i s embod i m e nt , t wo s e p a r a t e 1 i b r a r i e s of polypeptides, 
expressed from two separate libraries of gene:;, ,nr screened 
lor iiiteracti-.il with each < i h • ■ i . Tin ceil po p u i a t i c > n displays 
• Hi- 1 i b r ■ : r 7 . ■ 1 1 t h • > I 1 ; ; 1 1 ■ > • , • • >: = : • ■ t 1 y i i k ( ■ \ i n standard 
(' I .""TKM rrj ! p- >pu 1 a t i o-n . In t h : s emhu di iih ant , howevn; j , t he eel i 
populat ivon car r lea a is-) a second 1 ibrary ol genes. The.se genes 
reside in a vector that encodes a .secreted polypeptide. Such 
Vetera a r < ■ d > ■ a » e i t >« -d in the i ] t e r a t u t t - ( M» > ] ] e r ikon 1 e t a 1 . , 
! '* r ' e : . [• a • •} ] • . : . : i ■ : i ■ i i i 4 fin* o i • • . ; C e a ; d • . ; ! a ".' re 
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protects itself from the pathogenic agent and I_s "there fore 

positively selected. 

As was noted above, the term "polypeptide" relates to all 
types of peptides or proteins synthesr zeable by a host cell. 
To be imprinted by thxs inventxon, a polypeptide, must comprise 
at least one extra- or intracellular domain that accepts 
exogenous oliqopeptide insertions without disturbing 
translate or export of the polypeptxde to *he cell surface, 
n.*^ .aoacitv to accept insertions has been observed in 
numerous secreted polypeptides, which can be attached to tne 
outer membrane by a genetically engxneered anchor that xs 
described xn the literature (Sxnger, 1990; Cross, 1990), as 
well as xn each one ol tne ooze...-, bo^c.^, , 

m ammalian membrane polypeptides that ha^e been topologically 
mapped by gene fusion techniques. A number of these 
polypeptides, such as E . colx OmpA, E. coli LamB, and 
mammalian ICAM-1, their properties, and their capacxtxes to 
exhibit exogenous sequences in extracellular domaxns, are 
described in the literature. 

The membrane polypeptide may be autologous or 
heterologous for the ceil- Preferably, the membrane 
polypeptxde is heterologous. In preferred embodiments of thxs 
inventxon, all copxes of the membrane protexn gene xn a gxven 
clone must be altered xn some wayFor example, if a plasmxd- 
borne copy of the gene for the imprinted polypeptide bears a 
foreign insert at the imprinted domaxn, whereas the 
chromosomal copy of the gene remains wxldtype, then proper- 
selection by the CISTEM xs hindered, because the selection 
agent and/or the target compound can bind to the wildtype copy 
of the polypeptxde, as well as to the engineered copy. In case 
the cell contains the autologous membrane polypeptide but 
otherwise has the desxred properties for its use xn the 
present inventxon, the gene encoding the membrane protexn is 
inactivated, e.g. by deletion. Standard molecular methods are 
available to the person skilled in the art. 

Exogenous insertions at an extra- or intracellular domain 
are involved in two stages of each CISTEM. First, an 
oligopeptide tag is inserted into the imprinted domain, and 
cell populations exhibiting this tag are used in preliminary 
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trials to optimize the CISTEM. The oligopeptide t~ag, its 
insertion into the imprinted domain, and construction of cell 
populations exhibiting the tag, are all described in this 
section. Second, once preliminary trials with the tagged cell 
population have been performed, the actual CISTEW cell 
population is constructed: in this population, variable 
peptide sequences, rather than an oligopeptide tag, are 
inserted into the imprinted domain. The steps in the 
construction of C I STEM eel J populations are also described 
here . 

By conventional methods familiar to those practiced in 
the art, the gene encoding the imprinted polypeptide is cloned 
onto an appropriate vector. The vector's level of expression 
is important to C I STEM function. In mo st- CISTEM cell 
populations, roughly 1000 copies of the imprinted polypeptide 
are exhibited per cell. However, as is described below, in ' 
some cases expression is adjusted to achieve proper CISTEM 
selection. In preferred embodiments of this invention, 
regulation of the level of expression of the imprinted 
polypeptide is made possible through the use of a vector 
comprising an inducible promoter, such as the prokaryotic lac 
regulatory elements (Freudl, 1909), which are inducible with 
isopropyl-b-D- thi oqalaot opyranoside ( I F'TG ) , or the eukaryotic 
interferon promoter. The vector also contains a selectable 
marker, such as a nene tor antibiotic resistance, to 
facilitate transformation (selectable markers and their use in 
cloning vectors are extensively described in the literature, 
e.g., Brown, I 9 9 v , Freudl, 19H9). 

For prokaiyot i c prcteins, common expression plasmids are 
ns^d, and th»> i >w i yp< -oi j d< ■ : ; imprinted is pieieiied embod i ment s 
ol this invent r.m have nlrcniy been placed on suitable 
plasmids. Koi example, on \AiV7\ H (Freudi, 1989), the plaiimicJ 
USC(i particularly pretorred embodiment of this invent ion, 

■ <"o/j onipA (/•;. - •< / ? ()iit<-i membrane protein A is described) 
has t ' * * r : : ♦ ens- f s . : r ■ \ n . • i » . ' . .-••»>.' ♦ ♦ v i ... 
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to transform an amber-suppressing host, such as_T: call UH203 
(supF), expression is regulated not only by induction with 
IPTG but also by the concentration of the suppressor tRNA. 
The Plasmid also comprises a gene for ampicillin resxstance 
Similarly, on P SB1649 (Brown, 1992), E. call lamB xs regulated 
bv the inducible lac promoter on the common plasmxd pACl . For 
eukaryotic polypeptides, common viral vectors are suitable, 
and many polypeptides have been cloned into .pproprxate 
„ e „,_ c vor example, for the polytopxc mammalian glycoprotexn 
(ICAM-1) (intercellular Adhesion Molecule-1 is descrxbed 
further below), which is imprxnted in a preferred eukaryotic 
embodiment of thxs xnventxon, the CDM8 cDNA expressxon vector 

pCDl.8 may be used in conjunction wxth th c ™ ^e f- 

system (Staunton et al . , 1990). 

Th e vectors bearing the gene for the imprinted 
polypeptide are replicated and isolated. They may be 
replicated by a cell population, by PGR techniques, or by any 
other commonly employed methods described in the literature, 
in preferred embodiments of the xnventxon, 10 8 to 10 copies 
of the vector are isolated. The exact number depends on the 
number of distinct clones that wxil compose the CISTEM cell 
population, and on the technxgue used in the step descrxbed 
below, the insertxon of the varxable olxgonucleotxdes xnto the 

vectors . . 

The gene for a we 1 1-charact er x zed oligopeptide tag is 

inserted into the vector at a genetic site corresponding to 

the xmprinted domaxn of the polypeptide. Alternatively the 

oligonucleotide tag may replace a nucleotide sequence of the 

naturally occuring sequence. Replacement is accomplished by 

excising a natural coding region and by xnsertxng the 

olxgonucleotxde tag xn place thereof. The tag may be any 

peptxde sequence that xs known to bxnd wxth hxgh affxnxty and 

specificity to a ligand that xs commercially available or 

easll y produced in the laboratory by techniques described in 

the literature. One example of a commercially available 

tag/ligand pair is the T7 sequence (Met-Ala-Sez -Mel- *hr-^, - 

Gly-Gln-Gln-Met-Gly). which binds to the T7 antibody 

(available from Novagene, TB15 protocol). An example of a 

tag/lxgand pair that is descrxbed in the literature is ammo 
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acid sequence 36-41 of the eukaryotic polypeptide Ras and the 
Ras-binding domain (amino acid sequence 51-131) of Ra f (Block 
et al . , 1996) . The production and purification of Ras-RBD by 
commonly practiced techniques is described in the literature 

(Block et al., 1996). Many other tag/ligand pairs, such as 
peptides that bind to specific antibodies, are also described 
in the literature and available to those familiar with the art 

(Phizicky et al . , 1995). To insert the gene' for the 
oligopeptide tag at a genetic site encoding the imprinted 
domain, any common technique for inserting an oligonucleotide 
at a specific site in a vectoi can be used (e.g. Boyd, 1994). 
Several such techniques are referenced above, and one 
preferred procedure is described here- in more detail. 

Many membrane polypeptides have berrn topological ly 
mapped, and these maps are available in the literature. Using 
a topological map of a membrane polypeptide's sequence, two 
restriction endonuclease sites (site Rl and site R2 ) each of 
which is unique in the vector and is located in the genetic 
region coding for the imprinted domain, are identified. If two 
unique restriction sites do not already exist at the desired 
5' i t e o f i n s e r t i o n , they may b (. ■ e n q i r > e • ■ r e d u s i r i g a n y o n e of a 
number of procedures described in the literature (examples of 
unique restriction sires being ■ -na i n«: »-r ed at <i desired site of 
insertion are described in Brown, 19 92; Freudl , 1989) . The 
vert-: is • leaved it * h^ two - 1< -s i ar.,: t . d s l * es by digestion 
with restriction en domic 1 eases Rl and RZ , following the 
protocols and under the conditions specified by the suppliers 
of t)i" enzymer,. 'this yields <; ! .1 nr.ii j ;:ed vector with non- 
c * omp 1 o m r ■ n t r i r y st ieky ends (fC'l ,uid Id'./) and tin exciiO'd 
1 i , iom* :\ t . Th» ■ j ui-mi i :o i voe ■ t ■ i i . i . ,e 1 a t ed by s t and a i d 
elect i o p h o i e t. i c si :*. e 1 r a o ? juihit ion. 

An oligonucleotide template e r >mp j i s i n q t he 'gent? f o r the 
tag is synthesized using oonv* nt i one 1 t er:hi io 1 og y , such as 
ph"s:>) !■: a . Co t b / - > i f :ti : : t r v , ■ a • : n I •< ■ -b ' a ■ n. i t i in - a -nan - - 1 c i ,i i 
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i c fhp nenetic sequence encoding the 
and --tag gene-- is tne genei^ m _ 

oligopeptide tag. (It may be necessary to include one or two 
extra-oligonucleotides flanking the tag gene so that the tag 
is translated in frame with respect to the imprinted 
polypeptide.) Using the polymerase whose primer. is P, the 
oligonucleotide template is used to generate a double stranded 
oligonucleotide. This oligonucleotide is digested with 
restriction endonucleases Rl and R2 under conditions described 
„„ D r,-Mor«*. vieldinq an oligonucleotide with 

Dy Llie ciii.jii"'- ~~rt • . 

, , an H ps"? 1 This oliaonucleotide, wnicn nas 

sticky ends Rox ana r-s- . 

sticky-ends complementary to the linearized vector, is Ixgated 
into the vector by a common ligation procedure familiar to 
those practiced in the art. The result is that the vector's 
gene for the imprinted polypeptide bea« the gene for the 
oligopeptide tag at a genetic site corresponding to the 

imprinted domain. 

An alternative procedure for the construction of the tag 
aene with sticky ends RSI' and RS2' is used in some preferred 
embodiments of this invention, in this procedure, two 
complementary single stranded oligonucleotides are synthesized 
by conventional technology or acquired from a laboratory^ 
offering custom oligonucleotides. The two single stranded, 
oligonucleotides are synthesized such that, when they are^ 
annealed, they form a double-stranded oligonucleotide with 
sequence RSl'-tag gene-RS2'. This double stranded 
oligonucleotide is then ligated into the linearized vector 

described above. 

An example of this procedure is provided by a preferred 
embodiment of this invention, which involved the construction 
of an E . coU population displaying the T7 tag (figure 9, at 
loop IV of OmpA. in P EV218 (Freud!, 1989), the om P A gene has 
been engineered to bear a string of restriction endonuclease 
sites at a genetic region corresponding to extracellular 
loopIV of the polypeptide (figure 9). This plasmid was 
digested with Sad (enzyme and reaction buffer from New 

England Biolabs) and Xbal .enzyme and reaction burrer a, ^ 

Fermentas), yielding a fragment of the restriction site 
polylinker and the Unearned plasmid. The linearized plasmid 
was isolated by standard gel electrophoresis. Two single 
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stranded oligonucleotides were synthesized by" conventional 
technology to read as follows: 
Oligonucleotide 1: 

5' - CG [ ATGGCTAGCATGACTGGTGGACAGCAAATGGGT ] - 3 1 
Oligonucleotide 2: 

5 ' CTAG [ACCCATTTGCTGTCCACCAGTCATGCTAGCCAT ] CGAGCT - 3 • 

The sequence in brackets comprises the gene segment 
encoding the oligopept ide that is recogni zed- by the T7 
antibody. These two single-stranded oligonucleotides were 
annealed in MB J Fermentas Klenow buffer, forming the double 
stranded T7-tag gene segment with sticky ends complementary to 
the sticky ends on P EV218, linearized by Sacl and Xbal, as 
described in the previous paragraph. When the doubl e- s t landed 
oligonucleotide was li gated into the linearized plasmid, the 
T7 tag gene segment was in frame with the ompA gene on pEV2 1 8 . 
The new construct was designated pEV218T7. 

After the tag gene has been inserted into the vector, the 
vector is used to transform a population of cells lacking a 
chromosomal copy of the gene for the imprinted polypeptide. 
Strains lacking chromosomal genes for outer membrane 
polypeptides are described in the literature (e.g., Brown, 
1992, Freudl, 1989, and Staunton et al . , 1990), as well as 
below, in the passages relating to cell lines and media. 
Conventional methods of transformation arc used, and 
transformant s are selected by their expression of the vector- 
borne selectable market, such as a gene lor ampicillin 
resistance. In a particularly preferred embodiment of this 
invention, in which K . roll OmpA was engineered to display the 
TV polyp.. pt .id- Lag at looplV, P EV218T7 was used to transform 
/,; - r ' ,]l ::t '»<•'"' J !Oc, :n r v, om/>A, re. 'A, /ooAI'., n >::].) 

(Fr.Midl, 198'!), diror d in.) to «, common method oi 
elect r ot t ate-; i oi ma t. J on, as f ol lows . 

■■"> mi d overnight culture of UH203 (Luna broth, M"c) 
war. suspended in li c, f 1,1'. . CM:, woe cultivated .it <7 l V. with 

::h - ,k 1 ' W,! '" * '■>■• 'Ml I'll!" ! e,H-he.-j (>|. U ,„. ',, f !„• >- ! I ;. W- r • 
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three times and cells were resuspended in 1 ml _of ice-cold 10% 
glycerol. 42yl of the cell suspension was mixed with 2pl of 
DNA (pEV218T7 described above) solution (lpg dissolved in TE) 
and incubated 10 min on ice. 40ul of this suspension was 
pipetted to an electroporation cuvette (BIO-RAD ..Gene Pulser 
with Pulse Controller was used. Apparatus settings were as 
follows: 2,5 kV; 200 W; 25uF; time constant = 4,9 ms . 
immediately following electroporation, cells- were washed m 
600ul of SOC medium and transferred to a cultivation tube. 
rPlls were then allowed to recover for 1 hour at M "C in a 
shaking flask. Finally, cells were plated on LB plates with 
lOOpg ampicillin/ml . 

i t ^ rtnnoraip i hp V Fi T 1 able 

There are several wayo ^ ^ 1 ^ ^ -* — 

oligonucleotides that are inserted mto-the vector, and, 
ultimately, translated into the variable regions exhibited by 
the CISTEM cell population. To facilitate insertion into the 
vectors the variable oligonucleotides must have conserved 
sequences at their termini, just as the tag gene described in 
the previous section was given defined termini RSI and RS2. 
For example, in preferred embodiments of the invention, 
variable oligonucleotides with defined sequences at their 
termini are easily synthesized using solid-phase 
phosphoramidite chemistry (Brown, 1992): at defined terminal 
positions, only one of the four nucleotides is made available 
for addition to the strands being synthesized, whereas at 
variable positions, a mixture of multiple nucleotides is 
presented to the strands. Alternatively, end addition of 
bases, catalyzed by terminal transferase in the presence of 
nonlimiting concentrations of multiple nucleotides, can append 
a highly variable sequence to a defined terminal. Highly 
variable sequences may also he obtained by selecting fragments 
of defined length from digestions of natural cellular DNA or 

RNA preparations. 

In a CISTEM for screening the polypeptides encoded by a 
cDNA library, the variable oligonucleotides are the cDNA 
library genes. The size of the c DNA library that is screened 
by such a CISTEM is in some embodiments of this invention 
reduced by limiting the screen only to genes of interest, as 
determined by comparison of the cDNA library genes to 
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sequences cat aloqued in genomic libraries. In CI STEM for 
i dent \ f yin«j cancer antigens, cDNAs are prepared from tumor 
cells according to protocols described in the literature. 

In a r; I STEM for analyzing the interaction between a known 
polypeptide and a second known compound of interest, the 
variable oligonucleotides are sections of the gene encoding 
the known polypeptide, tor example, in a preferred embodiment 
of this invention, in which the interaction between the 
eukaryotic ce 1 1 - s i una 1 1 i rig proteins Ha:-; and Ha f is analyzed, 
the oligonucleotides inserted into pEV > 1 8 are sections of the 
s e « } u c n c e e i i « o d l r i q k a s . 

In preferred embodiment:-: of the invention, stop codons 
are excluded from variable oligonucleotide inserts in order to 
prevent truncation of tin. 1 imprinted polypeptide. This is 
a c h i e v e d by s y n t h e s i z i n g o 1 i go n u c 1 e o t i d e s t o read E) 1 ( NN P ) n D 2 , 
where Dl and 1)/ represent the defined sequences that 
facilitate insertion of oligonucleotides into vectors; n 
represents the number of triplet codons that compose the 
variable section of each oligonucleotide; N represents all 
four nucleotides, each with equal probability; and P 
represents a subset of nucleotides, each with equal 
probability. NMP encodes .ill ?n amino .n ids,, but precludes the 
(oowrrenee ■ > f . t ■ codons i e. -s >gn i /• -d b / th-_. CISTLM ceil 
population. Thus, when 1' is allowed to he G, C, or T, the stop) 

■odons TAA -mS "'-.'.A ar»' exsi;; l-.-.i 1 : : h< . > 1 i q. Jir.ir i ot i de 
inserts. It is not necessary to preclude the occurrence of all 
three stop codons in inserted oligonucleotides, because the 
Cli'TEM co 1 1 populfit ion n<s/ h«- o>nii" ):■■ d - o ; t op eodon- 

:upp r ess i n - 1 met ant : , wh i ■ "h >*.:-.■ \q - ib b < • ! ! r «i ns 1 «: 1 i nq oim- < ■ i 
mo r e i > l the st ' > \ ■ c. ;d-sa i. . Num-' : >; ; ; • supp r "s so r s t : . : l jis, are 
(]po:t ib'si in the liter, itur- (-.a. Hiewn, 1 ,: e')yq lenudl, 19M ( M. 

When t ho mipi ml pfj [m > 1 v'F>ept i i*e is, a o 1 yoopr c >t e i. r j or 
pi ■ > t e o q 1 y c a n , the inserted o 1 l 1 i o n 1 1 c 1 » . > t i d e ; nu i y include 

S • .. ' S > • IJ S ! I : • : - r ■ q : J . ; j ; ■ t i li 1 • Is k • d a i V • • . .'/ J . i t O ; S (''!;.! V » ! ; i I - , 
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variable positions, sequences coding for fixed-motifs, or 
statistical biases toward certain sequences or structures. 

The variable oligonucleotides are inserted at a 
prescribed site in the vectors constructed and amplified by 
the methods described above. Two preferred procedures for 
precisely targeted insertion are described here-one in which 
the number of oligonucleotides inserted into each vector is 
defined, and another in which the number of inserts is 

for insertinq 

oligonucleotides at specific vector sites are certainly not 
limited to the two described here. In fact, any of the gene 
fusion techniques used to insert a gene coding for a reporter 
moi ety mto the gene for a membrane protein may be used to the 
same effect as the procedures that wilT"be described. 

The site of insertion in the vector is located such that 
oligonucleotides inserted there are translated as part of the 
imprinted domain. The site of insertion may be further 
specified (i.e. within the sequence encoding the imprinted 
domain) according to structural analysis or prescriptions for 
a particular nature of interaction between the imprinted, 
domain and the target compound. To insert oligonucleotides, 
both of the procedures outlined here make use of restriction 
endonuclease sites that are unique in the vector, and are 
located at or near the prescribed site of insertion. If a 
unique restriction site does not already exist at the site of 
insertion, one may be engineered using any one of a number of 
procedures described in the literature (e.g. Boyd, 1994, Brown, 

1992 and Freudl, 1989) . 

To insert one and only one variable oligonucleotide per 
vector, two distinct restriction sites, each unique in the 
vector, must, be located such that they flank a short 
restriction fragment comprising the prescribed site of 
insertion. If the sticky ends left by cleaving the vector with 
one endonuclease (Rl ) at its unique site are RSI and RSI', 
while cleaving with the other endonuclease (R2) at its unique 
site leaves sticky ends RS2 and RS2 ' , then the variable 
oligonucleotides generated at step 3 are given defined and 
conserved sticky ends RSI and RS2 ' . The vectors are digested 
by both restriction nucleases (Rl and R2 ) , and the digestion 
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products are size-f ractionatecl to isolate the.^ectors, each of 
which has non-complementary sticky ends RSI' and RS2 . The 
population of restriction-digested vectors is combined with a 
slightly smaller population of variable oligonucleotide 
inserts ( vector s/ insert » 1.5 - 2.0), and ligation is allowed 
to occur. Because the sticky ends of the vectors are non- 
complementary to each other, each vector can neither ligate 
shut, nor ligate to other vectors. Similarly, because 
oligonucleotide inserts have non-complementary sticky ends RSI 
and RS2', they can neither self-ligate into loops, nor ligate 
to each other to lorm cone: a L a me r s of multiple 
oligonucleotides. The only ligation that can occur is the 
insertion of one variable oligonucleotide into each vector. 
After ligation has taken place, the vprtor population is size- 
fractionated to discard vectors that did not receive an 
insert. In the resultant vector population, each vector has 
one variable oligonucleotide at the prescribed site in the 
region encoding the imprinted domain. 

An example of this procedure for inserting one and only 
one variable oligonucleotide per vector is furnished by a 
preferred embodiment: of tins invention, in which a cell 
population is engineered t - > display variable oligopeptides at 
loop IV of E. roli ompA. In pKV;? \u (Freudl, 1980), r ho ompA 
gene has been engineered to bear a polyl inker rich in 
t est i i : " t. ion -nd-n ur } ea s- sites o i-i^ ti : region 

corresponding to extracellular ioopIV of the polypeptide 
(figure } A) . This plasmid is digested with Sad (enzyme and 
reaction butir-r ! rom Nov: tinnlnnd Miolabs'- rind Xb f iT (enzyme and 
react o >n buffer (rom MB I K* r m« wi t , t ; : . , under ondit ions 

1 V tdif .aippl i-i yielding iragmont oi the 
r (-it i l ct i on site po 1 y 1 i nkei rind Mie lmeaiized plasmid. The* 
linearized plasmid is isobded hy standaid gel 
e 1. ect r nphot er; l s . Single stranded o J i qoimf ] ( --.< ,t \ •]<■:■ a r e 
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polymerase, such as the Vent polymerase, are used t o generate 
double-stranded oligonucleotides from the s ingle- stranded 
templates. These double-stranded, highly variable 
oligonucleotides are then digested with Sad and Xbal, under 
conditions described by the suppliers, yielding .highly 
variable oligonucleotides with non- complementary sticky ends 
S'-CTAG and AGCT-3'. The highly-variable oligonucleotides are 
then ligated into the linearized plasmid (MM ligation buffer, 
conditions de-r-ribed by supplier); they ligate in only one 
oligomer plasmid, and in only one orientation. Conventional 
electrophoretic size fractionation is used to isolate ligated 
plasmids containing an insert. The product of this procedure 
is a population of plasmids (pEV218I) bearing, at the genetic 
site corresponding to loop IV of OmpA, --a highly variable 
exogenous sequence free of TAA and TGA stop codons . 

To construct a vector population in which each vector 
harbors between 0 and n variable oligonucleotides at a 
prescribed site, the following procedure may be used. First, a 
restriction site that is unique in the vector is located at 
the prescribed site of insertion, i.e., at a genetic site 
corresponding to the imprinted domain of the imprinted 
polypeptide. (If a unique restriction site does not already 
exist at the desired site, one may be engineered by 
conventional methods described in the literature, such as 
oligonucleotide-directed mutagenesis (e.g. Brown, 1992 and 
Freudl, 1989). If the endonuclease (R) that cleaves at this 
unique' site leaves complementary sticky ends RS and RS ' , then 
the variable oligonucleotides generated by the procedures 
described above are given defined and conserved complementary 
sticky end5 rs and RS ' . The vectors bearing the gene for the 
imprinted polypeptide are cleaved by P.. The vectors are then 
combined with an excess of variable oligonucleotide inserts. 
Several distinct ligation events can occur. First, the 
vectors, which have complementary sticky ends RS and RS 1 , can 
ligate to other vectors or self-ligate, closing without an 
oligonucleotide insert. Second, the variable oligonucleotides, 
which also have complementary sticky ends RS and RS ' can D 
self-ligate as a loop, ii) ligate into a vector with sticky 
ends RS and RS ' , iii> ligate to each other, forming a mult:- 
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oligonucleotide concatenation, which itself can_ per form 
ligations i, ii , or iii. After ligation, size fractionation 
can be used to discard all inserts that did not ligate into a 
vector and all vectors that did not receive inserts. This 
yields a population of vectors, each of which harbors between 
1 and n variable oligonucleotides at the prescribed site in 
the gene encoding the imprinted polypeptide. A specific 
example of this method of constructing a vector population in 
which each vector harbors between 0 and n variable 
oligonucleotides at a prescribed site may be found in the 
literature (the construction of such a library of vectors is 
described in detail in Brown, 1992). 

The vectors bearing variable oligonucleotide inserts are 
used to transform a population of cells— that lack a gene for 
the imprinted polypeptide. Conventional methods of 
transformation may be used. Trans formants are selected by 
their expression of the vector-borne selectable marker, such 

as a gene for antibiotic resistance. (Efficient transformation 

9 

of competent prokaryotic cells achieves up to 10 

transf ormant s/ ml.) The product of transformation is a 

population of cells, each of which harbors a distinctive' 

o 1 i g o n u c 1 e o t i d e insorti o n a t. a g o n e t i c s r t . e e o rres pond i n q t o a 

point in the imprinted domain. After transformation, the roil 

population is amplified by growth in a suitable medium. 

In a preferred embodiment of this- invention, in winch the 
CISTEM cell population is engineered to display variable 
oligopeptides at loop IV of F. roli OmpA, transformation is 
performed in the following manner. To- variable pJasmid 
population, pF.Wi>M, whoso oons t i not ion wis described above, 
is u •< \ : • ( t i . j ! . . t t * : :v. K . Ci / .< / ! : : j ; ; i I i . 1 > -' ( / a e, ruj >F, i -m/ >A , 
leu "A, proAi', /;>sh) , according to < : common method ol 
t ran r ; f o r ma t i on described in the literature (Frondl, 19^9) . The 
IJH?0/. pnpul at ion is grown in I.uiiti broth : ; upp 1 erne n t ed with 100 
mg A[ ) / m 1 , s " 1 • " * * no f ■ * r A r * r . ■ • - ! ■ o . r i ■ • ! t i a : i s f ■ - r m i n t : . . 
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clones as there were transf ormants in the previous step. Each 
clone dxsplays a distinctive oligopept.de at the imprinted 
domain. If the xmprinted polypeptide is a glycoprotein or a 
proteoglycan, gl ycosylation sites in the imprxnted domaxn bear 
diverse and unpredictable oligosaccharides. (Commonly 
practxced procedures for verxfying that an extracellular 
domaxn-in this case, the xmprxnted domain-has been 
glycosylated are descrxbed xn the literature' (Lis et al . , 

93 , In 3hcrt> the CT STEM cell population displays a vast 
combinatorial library of polypeptides. 

in a preferred embodiment, a CISTEM is constructed 
wherein the imprxnted polypeptide xs a polytopic membrane 
protein in which the domain that accepts insertions is . 
directly ad D acent to the adhesion domain- of a selection agent 
suitable for use in the CISTEM. This preferred configuration 
is depxcted xn figure 5, and several of the 

polypeptxde/seiection agent combinations that exhibit this 
conjuration are listed in table 1, and xn the literature 
(Tvler et al., 1996). When these polypeptide/pathogen 
combxnations, or others lx k e them, are used, the CISTEM is 
assembled wxth a minimum of genetxc mam pulation of the. 
imprinted polypeptide; the only necessary manipulation is 
introduction of an oligopeptide tag, as well as variable 
peptide sequences, to the imprinted domain, as described 
above. However, the list of polypeptides that may be imprinted 
is by no means limited to polytopic proteins exhibiting 
adjacent adhesion and imprinted domains. To --her expand the 
diversity of polypeptides that may be imprinted by ClSTEMs, 
two genetic manipulations are employed. 

When secreted proteins are imprinted, they are first 
converted to their glycolipid- or t r ansmembr ane-anchored 
xsoforms. Because translocation of secreted proteins and 
membrane insertion of integral proteins are handled by the 
same cellular machinery (reviewed in Singer, 1990, and above), 
many secreted proteins may be converted to membrane-bound 
xsoforms simply by carboxyl- terminal attachment or a 
hydrophobic transmembrane span or a signal sequence for 
q lycoli P1 d addition. For example, the q ene- f ragment encoding 
the 37 carboxyl terminal amino acids of decay accelerating 
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factor, a eukaryotic glycophosphol lpid-anchored protein, can 
be fused to the gem- for a secreted protein, such as human 
growth hormone, in order to target and anchor the protein to 
the apical surface of human intestinal cell lines. This 
procedure for converting a secreted protein to its glycolipid- 
anchored isoform is described in the literature (Cross, 1990) 
Conversion of secreted polypeptides to isoforms that are 
anchored by transmembrane spans is also described in the 
literature. For example, hepar in-binding epidermal growth 
factor-like growth factor (HB-EGFJ , can be expressed as a 
longer, membrane-spanning i soform (HB-EGFTMII ) comprising an 
extracellular domain that is structurally identical to the 
secreted polypeptide (Onu et al . , 1994). Other examples of and 
techniques for the conversion of secreted polypeptides into 
isoforms anchored by transmembrane spans are described in the 
literature (Singer, 1990). 

A second genetic manipulation of the imprinted 
polypeptide is performed when the domain that is to be 
imprinted lacks an adjacent adhesion domain. This occurs e.g. 
when the polypeptide chosen for imprinting is not polytopies 
or when the selection agents that adhere to domains adjacent 
to the imprinted domain are unknown or poorly suited for use 
in the CISTEM. In either situation, the sequence for a 
functional adhesion domain may be fused to the gene for the 
extra- or i nr race 1 lul a r domain, creatinu a gene for a chimeric 
polypeptide comprising both the functional adhesion domain and 
the variable domain. For example, the colicin A adhesion 
domain from hi. coJi" OmpK, when fused to separate domains from 
another outer membrane protein, OmpC, remains functional as a 
^olic t n A i p t • i (Koine] et al., l^ou). a group of 
oukaiyotic vri.il systems, winch die described in the 
literatme (Chu et fl 1 . , .199S), provide further examples of the 
insertion of functional adhesion domains, in theii application 

t • ) * Mn 1 H W j ] ro M ! t " / \ T t e ! • 1 » i ' t > r I * 1 ■ ■ » \ >...■■ . v 
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as the adhesion domain of the imprinted polypeptide. 
(Techniques for inserting exogenous sequences at targeted 
sites in extra- or intracellular domains are described and 
referenced extensively above) In this way, a functional 
adhesion domain for the virus is generated in the desired 
location, directly adjacent to the imprinted domain. 

To briefly summarize the manipulations of the imprinted 
polypeptide that have been described thus far, a secreted 
.^^ptidp mav be converted to a membrane-bound isoform, and 
afunctional adhesion domain may be inserted beside the domain 
that is to be imprinted by this invention. Owing to these 
manipulations, each of the polypeptide families shown in 
figure 1 may be imprinted by this invention. 

When this invention is used to identify the polypeptides 
that interact with a given target compound, the CISTEM' s 
imprinted polypeptide may be an outer membrane polypeptide 
suitable for imprinting. For example, in a CISTEM that 
identifies polypeptide sequences that interact with the 
eukaryotic cell-signalling polypeptide Raf, the imprinted 
polypeptide is E. coli OmpA. OmpA extracellular loop IV is 
imprinted to exhibit high affinity for Raf-RBD. The sequences 
selected for their affinity for Raf-RBD are then used to 
screen sequence libraries and cDNA libraries from mammalian 
cells with active Ras-Raf pathways. 

Similarly, when this invention is applied to epitope 
mapping, the imprinted polypeptide may be an outer membrane 
polypeptide suitable for imprinting. For example, the C I STEM 
that imprints OmpA to bind to Raf-RBD is used to map the 
surface of interaction between the cell-signalling 
polypeptides Ras and Raf. Sections of the gene encoding Ras 
are inserted into the genetic site encoding E . col, Om P A 
loopIV The C I STEM then selects clones that display sections 
of Ras that exhibit high affinity for Raf. In this way, the 
CISTEM identifies the epitopes on Ras that bind to Raf. 

Certain embodiments of this invention imprint 
polypeptides to exhibit functional activity. Among these, the 
preferred embodiments imprint polypeptides that are naturally 
well-suited to the performance of the functions desired of 
them (The structural and functional categories delineated 
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above provide an outline of the proclivities displayed by 
distinct families of outer membrane polypeptides) . For 
example, type la polypeptides are generally competent for 
functions involving high-affinity adhesion or specific binding 
to extracellular targets. They are therefore well-suited to be 
imprinted for use as highly specific markers or antagonists of 
small polypeptides, such as small peptide hormones. Type II 
polypeptides, on the other hand, are often enzymes with 
extracellular active sites, such as glycosyl transferases. 
Some type II polypeptides, then, are well suited to be 
imprinted to exhibit enzymatic activity. When this invention 
is used to imprint a polypeptide to serve as a functionally 
active effector, the preferable imprinted polypeptide is 
usually a membrane-bound isoform of a secreted protein, such 
as human growth hormone. 

Furthermore, in preferred embodiments of this invention, 
the imprinted polypeptide is a molecule that naturally 
exhibits functional or structural homology with molecules that 
perform functions similar or identical to the one desired of 
the imprinted polypeptide. For example, structural homologies 
to E . coli OmpA have been found in Haemophilus , Shigella, 
Neisseria , Bordetella , and Streptococcus . In these virulent 
bacteria, the OmpA homoloques play a role in adhesion to host, 
tissues, and are often recognized by the host's specific 
immune response. Thf st r u o f u r a J h< >:\. > ] < j y between OmpA cirri 
these antigenic adhesms makes OmpA well -suited tor imprinting 
for use in applications requiring mimicry of homologous 
udhesins. Two such app 1 i r -a t . i ons are competitive inhibition o 1 
mi c* i obi a J adhesion, rind vaccination, dose r i b o d i 1 1 urea t e r 
dot ti i ! h-](»w. 

Unlike many ether reams oi comb i na t. o r l a 1 chemistry, tins 
invention can screen libraries of polypeptides expressed by 
eukaryotio cell:;. This greatly expands the range ui molecular 
proper t i es a e r - e c s i b O > to tic;: Ot -o ■■ ; ■, t j f a , , \ ■ o >M1 y . t ■. >, .* i . ■ 
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saccharxde molecule attached to a particular site on a 

polypeptide is not a simple function of the local -no acid 

sequence. Instead, the glycosylate at a site is altered m 

complex and unpredictable ways by changes to the local 

sequence or to the sequence elsewhere in the polypeptide (Lis 

pt al 19 93) . Therefore, when the glycosylate sites within 

the imprinted domain are preserved while immediately flanking 

regions are varied by the method described above, a semi- 

. r ^ r ^„ e «uy of suaar structures is appended to a 

^^ch, of polvoeptide sequences, creating a 
systematic ai v e^. <~j - t ^ 
complex glycoprotein or proteoglycan library. 

' When a CISTEM's target compound naturally interacts with 
processed, e.g. glycosylated polypeptides (i.e., m its 
natural setting, the target compound bxnds to the processed 
forms e.g. to the glycoproteins or proteoglycans), it is 
preferred that the imprinted polypeptide be processed, 
especially glycosylated. This requires that the CISTEM cell 
population be eukaryotic. The adhesions that underpin 
inflammation, blood clotting, sperm-egg interactions, and 
intercellular recognition are all mediated by glycoproteins or 
proteoglycans. Therefore, for the identification of -lecu es 
for use as antagonists or agonists in these processes, C STEMS 
with eukaryotic cell populates and glycosylated imprinted 

domains are preferred. uar . Hnn 

The unpredictable effects of peptide sequence variation 
upon glycosylate throughout the polypeptide merit a 
cautionary note. Because changes to a peptide sequence can 
alter glycosylat ion on entirely different domains of a 
polypeptide, the CISTEM's selection (pathogenic) agent must 
not require a specific glycosylate on its adhesion domain. 
Otherwise, unexpected changes in the sugars adorning the 
adhesion domain renders some cells immune to the pathogenic 
agent even if they do not bind the target compound at their 
rmprinted domains. As was noted above, the selection site 
especially an adhesion domain of a pathogenic agent, used 
r I STEM is well-characterized. In most cases, tnis 
characterization reveals whether or not the agent requires 
glycosylates for adhesion. This may be verified, moreover, 
by remove glycosylates from the receptor and testing for 
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susceptibility to the agent; methods for removing 

glycosyl at ions are described in the literature (Lis et al . , 

1993) . 

For use as imprinted polypeptides in embodiments of this 
invention, polypeptides that exhibit structural integrity are 
preferred. It is particularly preferred that they not only 
maintain their properties when liberated from the cell 
surface, but also exhibit stability in the environmental- 
perhaps in vi vo- condi t ions in which the imprinted molecules 
are ultimately applied. Among prokaryotic polypeptides, P 
-barrels are exceptionally stable; they can be denatured only 
by boiling in strong detergents. And in eukaryotic 
polypeptides, g 1 ycosy 1 a t i on often adds structural stability to 
otherwise malleable protein structures (Lis et al . , 1993). In 
some cases, the structural integrity of imprinteci domains may 
be heightened by genetic manipulations. Such manipulations may 
be performed before or after imprinting, and they may be used 
to increase the stability of structures that will be attached 
to, or liberated from, the membrane. 

Genetic manipulations of the imprinted polypeptide may 
also be used to introduce an element of rational structural 
design to ^ CISTEM. Structural analysis of t tie target 
compound, its native li grinds, or molecules, that naturally 
perform functions similar to that desired of the imprinted 
polypeptide, may guide t tie ins^rti^i: •■ i : : ■ ii.-vi mot lis 

into th- imprinted polypeptide. A tendency toward the 
formation of certain structural motifs, such as a-heliees, may 
be incorporated into imp: inted domains by in. ci t inn variable 
oligopeptides that are not entirely random, but instead (1 ie 
statistically biased toward sequences that lonn the desired 
mrfif-. I'r • -de t t - 1 mi ried iiiutit;, may also bo inserted into the 
c o r i s e r v o d r o c j i o r i s o f i mn I j ntor] domains. F< w c ■ x a mp 1 * 1 , 
polyproline or a-holices may bo uso.i in expend a variable 
region aw, i y I" t om t } i< * < ~o 1 1 ; ■ u » f . i • - . • , o o. i i ;g : ! i:k * l • ■ a ■ " t : • : ; i t ■ I > ■ 
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• n * rhe CISTEM eel population 
In a Pr.'."- 'ff.r. 'chrolsomai copy of t h e gene f « 
consists of cells that la capable , upon transfection 

the imprinted ^.^he cop, of this ,™e, of 

with an episomal or vec ti the imprinted 

translating, exporting and « - in severa l 

polypeptide in the membrane Th».^ ^ ^ any 

preferred CISTEMs, which imprin ^ . n UH203 - (Freudl , 19 89), 

one of several targets, . _ population. 

mniDOses the umui r- , . , _ 

wnrcn is ^ ; ^ ^ ^ cQj2 LamB to D11 ,d 

Similarly, r„ .1-.- - _ 1M2)i „ hich lacKS the 

target, E. coil strai, ion . A eularyotrc example 

lamB gene, composes the ce P F CI5TEMS, which 

is provided by several P«"«~ " severe, targets. In 

imprint human " ^ thls straln lac.s XC»,1. 

these C I STEMS , COS-/ borne on the 

fr^qfpcted with an it/ui ^ y 

but is easily transfect CDM8 expre ssion system is 

CDM8 expression vector (the COS 

described in Staunton et ai . , populations are , 

The strains that compose CISTEM ce p p ^ ^ me 

preferably stop-codon ^ a t rans!atable codon. The 

of the stop-codons. such as ^ ^ recQgni2ed by the 
r eason for this » « ' popul ation must be excluded 

strain composing the CISTEM eel P P lnserC ed into the 

from the variable oligonuc eo t ^ ^ 

vector-borne gene for P , cert ain stop- 

synthesis of variable oiigonuc " ^ 
codon5 , and their Ins. rtion in o the g .» ^ o£ 

polypeptide, are described ah ^ f ^ ^ ^ CISTEM Mll 
variable oligonucleotides st op-codons. For 

population is ™ ^ ^ pre£erred 

example, E. coJa UH2 0 capable of 

— - Tu^-nl" Coventry, vanahle 
translating the UAG stop c (p£V218) , must 

oliaonucleotides inserted into the O P ^ 
b . „. o, the stop coders UAA and UGA, b t _n ^„ 

- trr --v» «^ i— — ■ 
r;: 1 ;;- ^/capable — 
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The medium in which the CISTEM cell popuLa-tion is 
propagated depends simply on the strain composing the 
population. For example, E. coli UH203 is grown in Luria broth 
or agar. When a strain contains a plasmid with an antibiotic 
resistance gene, the appropriate antibiotics are added to the 
medium or agar. Standard media for all commonly used strains 
of prokaryotic and eukaryotic cells are described in the 
literature and familiar to those practiced in the art. The 
conditions for transformation of CISTEM cells with the vector 
bearing the imprinted polypeptide also depend upon the cell 
type, and are described in more detail above, as well as in 
the literature. Induction of expression of the imprinted 
polypeptide requires incubation of the cell population with 
the compound that activates the imprinffed polypeptide's 
inducible promoter. As is described above,- however, certain 
inducible promoters, such as the lac promoter/operator 
elements, are "leaky", allowing for the appropriate level of 
expression of the imprinted polypeptide without any induction. 

When the CISTEM cell population is incubated with a 
suspension or solution of the target compound, the medium 
generally contains only the minimal requisites for cellular 
sustenance in addition to the taroot compound. The presence of 
abundant ma ■:: r omo 1 ecu 1 os other than t hr target, compound can 
result in s e 1 e c 1 i on f o r c 1 on e s who s e l mp r i n t e d doma in b i nd no t 
to the tarn- "i, but lather • ■ ■ . ■! t h»e _aai is t i t uen \ a ul the 

medium. Thus, for example, in preferred CISTEMs imprinting 
OmpA, E. coli are incubated in the presence of any one 

of several tari-t < ompounda in ATCC Medium . howeve: , in 
:;onu s CISTEMs, Ian in broth was aa^d. la several preferred 
f'lSTEMs I-ai i;r,;u a.* inj I < "AM - ■ : , < '* >S iv I I a M h: incubated with 
target compound in T< " MmimaJ Mri.lniin Kdiiic or Minima] TC 
Medium g 1 * . Minimal media f<u ether coniinonly used p r o ka r yot i c 
and on ka ryot ic cell L i n^;; ru «• dosei l bed in t hr literature. The 
conc'lil i ,it inn • ! ' a r a - a ■ - . a:a ■ ■ a a . I a . d d a i • : f o a . < C i n c u b a t 
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fh the target compound occurs undex. chemica 1 and 
rncubation with the tar, " as pQssible to those 

r^ H p - polypeptide will ultimate!* be 

: e t example, one preferred eu.aryotic CISTEM imprints 

to serve as an antagonist to „IvgpI20.s *«act. 
wrtn the receptor CD-4 (Fantini et al . , 1997).. In thrs CISTEM, 

r~» ^131^ i S rr - : - su^^rr 

solubl HIVgpiaO ' : xtrac P ellular £luid (table 

" heTpplication o f the invention to fnteractron 

lal sis or epitope mapping, the incubation occurs under 
ch-ica, and thermai conditions that are as m ar a . 
possible to those in which the studied » — 

CakeS „:;router ,=* t a„e polypeptides are ports o« adhesion 
for multiple agents that ,11 or retard the 

17 th cells they enter. Each of these agents recognizee a 
distinctive adhesfon domain on the polypeptrde receptor 

2 - »-r ^^yr^i/i/srctir^ 
r::::ziz zzi:: ^ —ly ^ rr * - - 

characterizations, a seiecciui y fifTlire 5) is 

resides besfde the imprinted domain (as shown i - ^ 

fn the event that the polypeptide that is to De 
identified In the e sele ction agent with 

imprinted is ° r che imptinted domain cannot be 

an adhesion domain adjacent to tn inserted by 

found, a functronal adhesion domain is appended or inserted 
genetic manipulations of the imprinted P o ly peptide. these 

i ^- - arP described above. 
m antpulatfo,,s with a „ adnesion domafn besfde the 

Once a selec t ion agent £ollowin g procedures 

imprinted domafn has been identified the selection 
ace performed to establish and optrmrze he agent ^ 
for cells in which the imprinted domain binds the target 
compound. This procedure serves two functions. Firs . 
establishes that the selection agent srgni f leant i, '-«•»-- 
cells in which the imprinted domain fails to bind the target 

d while leaving unaffected those cells in which the 
compound while lea J t . In other U ords, 

imprinted domain ettecuvtiy 
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the procedure demonstrates that the agent is competent to 
perform CISTEM selection. 

Second, this procedure is used to determine the 
experimental conditions that will allow reliable C I STEM 
selection. To adjust C 1 STEM selection, multiple parameters may 
be varied, including the following: concentration of the 
target compound ([T]) with which the C I STEM cell" population is 
incubated; duration of incubation with the target compound; 
initial multiplicity of infection (Mol)or concentration of the 
selection agent; duration of incubation with the selection 
agent; temperature; level of expression of the imprinted 
polypeptide. Each of these parameters may be optimized by 
varying the parameter value over a number of CISTEM trials, as 
will be described here. However, every parameter need not be 
optimized for every embodiment of this invention. In preferred 
embodiments of the invention, at least two parameters are 
optimized by multiple trials, as described here, while other 
parameters are set by common laboratory procedure or 
precedents described here. In the optimization described here, 
concentration of target compound, [T], and initial 
multiplicity of infection, Mol, are the two parameters that 
are varied over a number of trials. In the examples below, an 
optimization is described in which two different parameters - 
namely, level of expression of the imprinted polypeptide and 

duration of intubation with the selection aaent ar*- varied 

The procedures for demonstrating that the selection agent 
is competent to perform CISTEM selection, and for optimizing 
CISTEM parameters, make use of the cell population whose 
construction is described above. In this ('ell population, c: n 
o] i dopept i d>> I a.j :;, exhibited in tie Jiiifi mt e:l domain. This 
tag binds with high all nut / en i specificity to a readily 
available iigand. (The T7 t ag/T7 antibody is a preferred 
example of a tag/ligand pair; this pair and others an: 
described above arid in t h- ! l i ei . , t i j r - ) in the op. + > n i :■' . 1 1 v >» - 
ii'-.'T i bo i . v.; \ r . f : -e : • j , 1 • • r ; • ? * • d.^ 
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l6 vel of eKpress.on of the printed polypeptides" on. of the 
varied parameters. The method by which regulation of the 
imprinted polypeptide is achieved depends upon which 
heterologous promoter controls the gene for the rmprrnted 
po ypept de. For example, on pEV2 18 T,, the ompA plasmid used 
in a preferred embodiment of thrs invention, ompA « ™»« «» 
centre of the lac regulatory elements, which are rnduced by 
iCopyl-b-D-thiogalactopyranoside ,IPTG). Ih.ddit.on. the 
rsopropy ^ _^ ; „ 0T , „ om , rises a TAG stop codon corresponding 
omp« gene ; " ,„„ o£ che pol vpe pt ide ; therefore, 

to ammo aciu - i treua - ' ^ , • 

when expressed in the amber- suppress ing (supF, host E. coll 
™ 2 03, ompA expression is regulated not only by the leve of 
induct ion with IPTG, but also by the concentrator, 01 „..c 
suppressor tRNA, which is expressed from-a plasmid Low ieve 
, „ mnA i- achieved with 1.0 mM IPTG induction. On 

;z~i rider ot - 1^^,, 

— and expression is r gulated as » - 

literature (Brown, 1992) . Keguici 

nrnl o th e CDM8 vector construct used in a 
ICAM-1 from pCDl - a , tne ^-^ 1U . 
'referred eufcaryotic em.od.ment of this inventxor , xs also 
described in the literature (Staunton et al .990 . 

Two populations of cells (populations A and B) exhibiting 
the oligopeptide tag xn the xmprinted domain are grown xn a 
Medium appropriate to the cell type and the expression vector. 

P narticularly preferred embodxment of the 

ZTTco i UH transformed by P EV218T7 are grown at 
ITc T^ supplemented with lOO.g/ml ampxcxllin, and 

1 0 mM IPTG for xnductxon of P EV218T7 expressxon; conditions 
tor growth of other cell types are described above and xn the 
literature.) A thxrd population of cells (O of the same 

^t-ions A and B , transformed by the same vector 
strain as populations A ana a, 

as populatxons A and B, but wxthout the olxgopeptxde tag, 
grown xn the same medxum. (Thus, for example xn the 
particularly preferred embodiment mentxoned above, E. 
particular y p ra ther than P EV218T7, are grown 

UH203 transformed by P EV218, rather una P ^ 
in the conditions described above.) Populatxons * anu .are 
each grown to generate three clonal subpopulat ions of 
C ells! Bl, B2, B3, and Cl, C2, C3 . Population A is grown to 
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generate nine clonal subpopulations, Al, A2 . . .A3-, each 
comprising 10 () cells. 

Populations Al . . . 9 and CI ... 3 are incubated with the 
target compound, while populations B1...3 are incubated under 
identical conditions, but without the target. Optimal 
conditions and time required for binding between the 
oligopeptide tag and the target are prescribed by the supplier 
of the.- tag/target system or in the literature (Phizicky et 
al., 199 r ..). In the particularly preferred embodiment, in which 
the T7 tag is displayed at loop IV of E. col i OmpA, and the T7 
antibody (Novagen) serves as the target compound, the cell 
population expressing pEV718T7 is incubated with mAbT7 for 
approximately 6 0 minutes. The concentration of target compound 
with which cells are incubated is systematically varied across 
populations A1...9 and CI... 3 in order to identify the 
concentration that is optimal for CI STEM selection (table 3). 
Populations Al . . . 3 are each incubated with the lowest 
concentration of target compound. This minimum concentration 
varies, depending upon the properties of the agent, and the ^ 
availability of the target, but is generally approximately 10" 
x c o n c e n t rat ion of c: e 1 1 s x r i u mb e r o f i rn print e d p o 1 y p ept ides 
pel cell. Thu. c . , for example, for a population of 10^ E. coli 
grown under the conditions described above to express roughly 
1000 copies of OmpA per cell, 0.00] mg niAbT7 is used in 1 ml 
of medium. This, i :. app : c x i nm t. 1 / A x J 0 ' target molecules 
(mAbs) per imp-rinted po 1 ypopt i de . Populations A4 . . . 6 are 
incubated with a hiahei ronront rrd ion of target compound, such 
d l()x t bj e conc^nl I <it i oi. u s ♦ ■ d with populations A 1 .... 3 . 
}'-. ipi: ! a t i oris A '/ ... 4 a no; e | . . . < :i r »■ : n o ib- 1 1 <--d with t he hi ghest 
o >iifrnt i a t i on of target, . or h is 1 no v t h - ■ eon< 'on t i a t i on used 
wj t h popul at l ons Al ...->. 

fro 1 1 owi rig i i icuhd t l oi i , populations A I ... 0 , 13 .1 ... 3 , and 
C 1 . . . 3 a r o exposed t • ^ tin* ■ 1 eo t i < >n .uj'-nt . Ttic technique of 
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eukaryotic cell populations to viruses are described in the 
literature (e.g. Staunton et al . , 1990, Tyler et al., 1996). 
Like the concentration of target compound, in the optimization 
described here, the concentration of agent is varied across 
the cell populations in order to identify optimal- conditions 
for the CISTEM's selection (table 3). (As was noted above, 
however, not every parameter is varied in optimizing each 
embodiment of this invention. In the optimization described 

f taraet is not one of 

below, ror exdmpxe, 

.... r i .j tt-^*-o^h concentration is 

the parameters tnat xs vcn.u. A noteaa, 

fixed, while two other parameters are varied). Populations 
Al.4.7, Bl, and CI are exposed to the lowest concentration of 
the selection agent. This concentrations/ill vary from one 
CISTEM to another, depending upon the properties of the agent, 
the method and duration of exposure, and the availability (and 
thus the concentration) of the target compound, but it will 
generally be roughly the minimum concentration required to 
affect every cell in a fully susceptible population during a 
CISTEM run. Thus, for many lytic bacteriophages, such as . 
coliphages K3 or X, when the method of exposure is classical 
phage infection (Calendar, 1988), and the duration of 
incubation with the bacteriophage is 60 minutes at 37°C, the 
concentration will be 2 - 5 x the cell concentration, or Mol » 
2 - 5. Cell populations A2,5,8, B2, and C2 are exposed to a 
doubled concentration of the agent, and populations A3, 6, 9, 
B3, and C3 are exposed to yet another doubling of agent 
concentration. Table 3 summarizes the variation of target and 
agent concentrations over the cell populations in this 
procedure for optimizing CISTEM selection. 

The concentrations in this procedure may vary depending 
on the nature of the selection agent and the method of 
exposure, but several fundamental aspects of the procedure do 
not change from one C I STEM to another. First, in the control 
populations, B and C the selection agent has unimpeded access 

if. receptor, and therefore affects every cell- Population 
B is susceptible to the agent because, although cells bear the 
tag at the imprinted domain, the population is not exposed to 
the tag's ligand. Therefore, the selection agent's site, e.g. 
the adhesion domain, remains exposed and functional. 
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Population C is susceptible because, although tji.e cell 
population is exposed to the ligand, the ligand is not bound 
by the imprinted polypeptide, which lacks the ligand' s tag. 
Because the selection agent's adhesion domain remains exposed 
and functional in populations B and C alike, the results of 
exposing these populations to the agent are the same. Thus, in 
these control trials, exposure to the agent yields the same 
results for populations Bl and CI; likewise for populations B2 
and C2; and for populations B3 and C3. These controls confirm 
two conclusions critical to the CISTEM: 1. the presence of the 
target ligand does not affect the selection agent's activity, 
provided the ligand is not bound to the imprinted polypeptide; 
2. the presence of the tag in the imprinted domain does not 
affect the selection agent's activity, provided the tag is not 
bound to a target ligand. 

Whereas control populations B and C are entirely 
susceptible to the selection agent, populations Al . . . 9 exhibit 
a range of resistance, extending even to immunity. In the 
cells of population A, the imprinted domain harbors the 
oligopeptide tag, which binds the target ligand during 
incubation. As a result, the adjacent adhesion domain is 
occluded from the selection agent, and the cells are 
protected. Populations Al . . . 9 exhibit: a range of resistance 
because they are exposed to a range of concentrations of 
target ligand. When the range of concentrations is broad 
enough, the range of resistance extends from partial 
susceptibility to complete immunity. 

Based on rhese preliminary trials, the concentration of 
selection agent that allows for functional : I STEM .selection is 
found as i (... 1 lews . Ubi. qu i I ous pat hogein c of I ect --i.e. every 
cell in a population is al looted by the agent.--is easily 
observed when cell populations are exposeci to agents according 
to protocols described in the literature?. For example, in 
exo-cinq prMkaryotn- populations t ■ > VMub-iC lytic phage, 

1 ) \ \ 1 < \ 1 l i t 1 ! l «• , . T ► , 1 . T , , • ; , t T i * « . i . . , . . , t , . . 1 .... : i i \ 
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some cells from Bl and CI (table 3) escape the.-selection 
agent, but the cells of B2 and C2 are all affected by the 
agent, then the appropriate concentration of selection agent 
for the CISTEM is Mol =10. In this example, the minimum 
concentration required for ubiquitous effect could be more 
exactly determined by further trials with Mol ' s ranging from 5 

to 10. . 

After the CISTEM' s concentration of selection agent is 

of tarqet compound is 

lounu, tne ^P Liuid ^ 

T ft->- t ctfm has an aDDroximate 

determined as fonows. la.., -i=>. a 

threshold affinity, K a = t, such that clones whose imprinted 
domains bind the target with an affinity below this threshold 
(Ka < t) are eliminated from the population by the selection 
aqent, wheieas clones whose imprinted domains exhibit an 
affinity above the threshold (K a > t) are protected from the 
agent, and thus selected. The threshold for a CISTEM can be 
set by adjusting the concentration of the target compound. To 
set the threshold to approximately the affinity that the 
oligopeptide tag exhibited by populations A and B shows for 
its ligand, the target concentration used in the CISTEM is the 
lowest level that provides complete protection to population 
Ai in the presence of selection agent at the C I STEM ' s Mol 
(Complete immunity to the agent is easily observed when cell 
populations are exposed to agents according to common 
protocols described in the literature.) Thus, for example, if 
the CISTEM' s concentration of selection agent, determined as 
described above, is Mol = 10, and population A5 exhibits 
complete immunity to the selection agent while population A2 
is slightly reduced by the agent (table 3), then 10 molecules 
of target compound per imprinted polypeptide is used m a 
CISTEM to select for cells whose imprinted polypeptides bind 
the target with an affinity equal to or greater than the 
affinity exhibited by the oligopeptide tag for its ligand. A 
slightly lower concentration of target compound, such as 
10 3 mole cules per imprinted polypeptide, is used to select for 
cells with imprinted polypeptides that exhibit an affinity for 
the target that is significantly higher than the affinity 
exhibited by the oligopeptide tag for its ligand. A higher 
concentration of target compound, such as 10 J molecules per 
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imprinted polypeptide is used to set the CISTEHls threshold 
affinity well below the affinity exhibited by the tag for its 
1 lqand . 

The application of CISTFM to cell populations yields cell 
clones that survive the selection procedure by virtue of the 
interaction between the polypeptide of interest and the 
target (s). The sequence of the polypeptide of interest is 
encoded by the DNA sequence which has been inserted into the 
display domain. The surviving cell population can be 
monoclonal or polyclonal. In the latter instance, surviving 
cells need to be plated out on solid medium plates to recover 
colonies derived from single cells (state o f the art bacterial 
genetics). Amplification of monoclonal cell populations and 
the subsequent preparation of their relevant episomal or 
vector DNA is performed according to standard protocols of 
recombinant DNA methodology. Since the episomal DNA codes for 
the display protein and its inserted polypeptide of interest, 
DNA sequence analysis of the inserted DNA tag (state of the 
art recombinant DNA technology) provides the DNA sequence of 
t hi e insert an d t h e reby a LI o w s the prediction of the amino acid 
s e quence o 1 t h e p o 1 y p » j p t id e o i interest which lias bee n 
e x p r e s s e d a n d d i s p > 1 a y e d at t h e r. u r f a c e a s integral p art of the 
meml.tr an e piotein. The inserted DNA is der i ved from a pool of 
randomly synthesized oligonucleotides or from DNA fragments 
der i ved i r on: i t.ui a i c ; " u : nr] :uRNA !• rl'UA; v . r qc j n(jii;.i DNA. The 
selected DNA and cognate polypeptide sequences can be 
scrutinized to match sequences of known proteins and their 
coqn-it e gen- • . Thus., sequei i ■♦■ .mil y:a can be employed to 
i d « -» n * i I y t p- > 1 ypept i de 1 i n t e r " s, t and to- a \ f r ibu t. » 1 and/o j 
I. e L < i : e it. i - 1. j i n t :>: UdI; In hi »< >wi , -j* - in - •; e i I >NA s e que noes and 
the i t pr odioted prot e i ns . 

Modification:; and elaborations ol the basic CI STEM 
achieve several extensions . d it ; capab i 1 i t i es , including the 

i O i it ; W ill J : i . i U.'Ii.' '1; ' .it if: ■. ■ I : ! I < I il . i X 1 Iil U III « i I 1 1 n I 1 y - I : ■ » d t ! t » " ; 
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selection not only for affinity, but also for -Junctional 
activity; v. screening the polypeptides expressed from a cDNA 
library for binding to a target compound; vi . selection for 
imprinted polypeptides that exhibit affinity for one or more 
of a set of multiple, distinct target compounds.- 

To increase the maximum affinity attainable in a CISTEM, 
iterative CISTEM runs are performed (figure 10) . Once a 
population of clones exhibiting affinity for the target 

■» _ _j 1 v.,. -, ncTVM t-he> b^ar inn 

compound nas oeen sexo-oc^ >~-j a ^ ~ - — - , 

the gene for the imprinted polypeptide are isolated from this 
population by common techniques appropriate to the vector, and 
described in the literature. New variable oligonucleotides, 
such as those described above, are then inserted into the 
vectors alongside the original variable-insertions. This is 
performed by the same procedure that was used to achieve the 
original insertion of variable oligonucleotides at a genetic 
site corresponding to the imprinted domain. (One such 
procedure is described above) . Alternatively, variation is 
introduced not as an extension of the original oligonucleotide 
inserts, but rather as an oligomeric rearrangement of the. 
original inserts (the procedure of oligomeric rearrangement of 
oligonucleotide inserts is described in Brown, 1992) or a 
limited mutagenesis of the original inserts (techniques for 
the limited mutation of oligonucleotide inserts are described 
in Balint et al - , 1993; insertion of mutated oligonucleotides 
is performed according to the procedure described above for 
insertion of variable oligonucleotides into a targeted site in 
a vector). In other words, oligopeptide "sequence space" may 
be searched by a number of strategies, some of which have been 
applied to screening other combinatorial libraries (Balint et 
al., 1993, Brown, 1992, Janda, 1994). The vectors bearing 
fresh variation at a site corresponding to the imprinted 
domain are then used to transform a population of cells, 
according to common protocols cited above, creating a new 
CISTEM cell population. This population is then subjected to 
yet another round of selection by the CISTEM (figure 10). This 
cycle of introducing variation, selecting for affinity for the 
target compound, and allowing selected clones to multiply, is 



BNSDOCID <WO 99301S1A1 | > 



WO 99/30151 



PCT7EP98/07922 



- 75 - 

iterated until binding between the imprinted domain and the 
tarqot. compound attains the desired degree of affinity. 

The number of oligopeptide sequences screened by a CISTEM 
for affinity for the target compound is increased, i.e., 
sequence space is increased, by performing CISTEMs in 
parallel, by combining CISTEMs with hi gh- throughpu t robotics 
strategies, by performing CISTEMs on a large-scale, or by 
per tormina CISTEMs in a continuous-throughput chemostat. 

To create imprinted polypeptides that exhibit multivalent 
landing of tho target compound, multiple variable regions may 
bo imprinted by sequential C I STEM runs (figure 11). First, a 
single region or domain of the imprinted polypeptide is. 
imprinted to bind the target compound. Next, the selected 
popu 1 a t ion o f ce 1 1 s , whi ch exh i hi t the previous 1 y - i mprinted 
domain, is engineered to exhibit a variable region elsewhere 
in the imprinted pol ypept i de-ei ther in an unimprinted region 
of the previous ly- imprinted domain, or in an altogether 
separate domain. This is achieved as follows. First, the 
vectors bearing the gene for the imprinted polypeptide are, 
isolated from the selected population of cells by common 
techniques appropriate to the vector and described in the 
literatur - . N < x t , nrinn on*- of the p r o c : e d u r e s d e r : c r i b e d a 1 > o v t 1 
for th- ins*-- r *" . on < f variable ol i gonuc 1 cot ides at a specific: 
site in a gene 1 , variable sequence:; are inserted at a genetic 
: it. ! ■• ■> r r » -spond i nq to . oi ► ■ :■: t r a ■ 1 1 u i a : domain ^: t ho imp! i nt (»ti 
polypeptide. This site is distinct, however, from the site at 
which variable oligonucleotides were inserted far the first 
luinid ' :j oTKM s« ' 1 c { ' t i o: ; . The vectors, which now bear 

reic e* ei ;< > \\ n ■ n r. • s .it ( 4 U* * : : i ' i • and variable o 1 i qonuo 1 < > t i < ie s 
■ : * ano t h- : i * ' , ■ i : » a : • • • o to ? i a n s 1 o i m « i pop >u 1 a ' i < >n c < 1 ( e lis, 
cie.it.inq a new Cir.Tfc.M cell population. This population i s 
;:ubjeeted to another round o f CII'.TF.H S'eieet ion. In this round, 
the concent tat ion of target compound is lower than it was in 
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i mprl nted polypeptide that is Sprinted, a new^pTtope on the 

target compound is bound. 

Certain embodiments of this invention select for 
printed polypeptides that exhibit high on-rate in their 
Z n with the target compound, in these CISTSMs, the CI STEM 
cell population is not pre- incubated with the target compound 
prior to introduction of the selection agent. Rather, the 
selection agent is introduced simultaneously with the target 
. „. . e ^„Hnp S of aaent and target compound are 

M-\ r *~ a" 1 - a<= is described m the 

varied over multiple LLx3 ^' ~~ 

optimization procedures detailed elsewhere in this document. 
Concentrations are adjusted such that, upon simultaneous 
introduction of agent and target to the C1STEM ceil 
population, clones whose imprinted polypeptides oind to the 
target relatively rapidly are protected, whereas clones whose 
imprinted polypeptide binds to the target more slowly are 
eliminated by the selection agent. _ 
A risTEM that imprints a polypeptide to show high 
affinity for a target compound but extremely low affinity for 
a second compound, which is here called the -repelled 
compound", is constructed as follows. The size of the target 
compound, the size of the repelled compound, and the distance 
between the imprinted domain and the adhesion domain are 
altered, if necessary, to meet the following condition: when 
the target compound is bound to the imprinted domain it 
occludes the adhesion domain, whereas when the repelled 
compound is bound to the imprinted domain, it is too small 
occlude the adhesion domain. If this criterion is not met 
fortuitously (i.e. if the repelled compound does not happen to 
be extremely small), then the diameter of the target compound 
and the distance between the domains may be altered, as 
follows. The target, compound may be enlarged by creating a 
multimer. (The procedure for creating a multimer depend upon 
the nature of the target compound, but many techniques 
ioinino multiple copies of a compound are commonly practiced 
and described in the literature.. The distance between tne 
imprinted domain and the adhesion domain may be increased by 
moving either domain to a site on the imprinted polypeptide 
that is topological^ farther from the other domain. 
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Transposing t he imprinted domain simply involves inserting 
variable oligonucleotides, according to the procedure 
described above, at a genetic site corresponding to a 
different extracellular domain of the imprinted polypeptide. 
In many prokaryot 10 and eukaryotic embodiments of this 
invention, the adhesion domain may be transposed to a 
different extracellular domain of the imprinted polypeptide 
without affecting its function as a receptor for the selection 
agent. The procedure tor transposing an adhesion domain is 
described above as well as in the literature (Chu et al . , 
1995). II", cis a result of moving the adhesion domain and the 
imprinted domain farther away t rom each other, one or more 
extra- or intracellular domains, that are not involved in 
C1STEM function occur between the t wo , Topo 1 og i c 1 1 1 y separated 
domains that arc critical to CI STEM function, the superfluous 
domains may be replaced by polyproline linkers in order to 
ensure that extra- or intracellular structures do not impede 
proper occlusion of the adhesion domain by a target compound 
bound to the imprinted domain. 

Once the three critical distances - the distance between 
the imprinted domain and the idhesion domain, the diameter of 
the taraet compound, and the diaimder of the repelled compound 
- .ire i:; aoeerdanee with the ci i M n>n stated above, the 
C1STEM cell population is incubated with both the target 
compound and ■ )<> rep. ] ] ■--■d ■< mp- >i:i:d . Tliri. t he c i s T KM ' s 
selection agent is introduced, and this creates two selective 
p r e s s u r e s . On e p r e s s u r e f a vt r s c e lis expressing i mp r i n t e ci 
po 1 ypept l des wjth higher .llfinity foi the target compound 
(figure 1 ? } : . i f jni"' < '■ >m} • . »und bound to an i mp i j n! ed (iom.i j 1 1 
( 1 • i u d * ■ . t : : '. ■ ■ i o i a o ' ■ n ' . j i h • . . i • . r • d t au < i i n , and thus s e r v • - s to 
protect the imd'i 1 y urn (a 1 ] 1 i om tin- stM e ctiun .agent . Amoi i g 
eel 1 s, that exhibit some nttinity to* the target compound, a 
second select iv»- pies, sane tavoi:; cells with lower atf inity lor 
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that exhibit imprinted drains with high affinity tor the 
target compound and low affinity for the repeiled compound 

'"""To 'elect not oniy for affinity bot also for functionai 
actrvity of imprinted pol ypept rdes, a basio CISTEH may be 
couPled to a secondary screen for actrvity. !n_tne first step 
of coupled .ejection, a CISTEH selects clones whose printed 
polypeptides bind the target compound in the prescribed 

polypepc ^ ^ rl „„„. or from the imprinted 

fashion. From tnis popuia-^ - 

dome ns extracted from such a population, a secondary screen 
Elects the structures whose binding to the target compound 
"k,k... ,hc desired functional activity. Imprinted domains 
n b rrated ,r« the ~U .«f«. by «*Wr* or by expression 
a, a secreted isomorph (isomorphic forms and their 
ihterconversion are described above) may be subjected to any 
commonly practiced screens or assays for activity or 
modification of the target compound's activity. For 
one CiSTEH imprints E. coil OmpA to bind with high affinity to 
soluble JL-6-P.eceptor (This target compound is described in 
Tore detail above, . The imprinted polypeptides, freed from the 
cell membrane by a conventional gentle detergent, which ,s 
kn ow„ not to denature ompA, are then selected to . second 
screen lor their capacity to agonise the 

functional 1 1,- 6-Receptor . This screen for agonistic activity 
is described in the literature (Hibi et al . , 1990) 
Alternative^, extremely strong selection tor '™«lonal 
actrvrty may be exerted upon the CISTEM's selected set of 
cellular clones, and does not require liberation of the 
imprinted domains from the cellular membranes. In ac . 
precisely because they yield replicating cell population, 
CISTEMs can be coupled to selection regimes very similar to 
genetic complementation. In these selection regimes, each 
cell's supply of a vital compound, such as a nutrient is 
dependent upon the imprinted domain's performance 
function desired of it, such as entymatic activity. n other 
words, the imprinted domain's functional activity el her 
converts the target compound to a compound vital to the 
underlying cell, or secures the vital compound tor the cell. 
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In certain embodiments of this invention,--- the target 
compound consists of a library of separate and distinct 
molecules, which is here called „the target library". A CISTEM 
in which a target library (rather than a single target 
compound) is used serves to select interacting pairs in which 
one member of each selected pair is the oligopeptide displayed 
at the imprinted domain of a protected clone, and the other 
member is a target library molecule that binds to this 
oligopeptide and thus serves to protect the clone. Provided 
each molecule in the target library is large enough to meet 
the size requirement for CISTEM function, and is present in 
the medium at a concentration adequate to confer protection 
upon cells whose imprinted polypeptide:; bind to one or more 
molecules in the target library, the target library may 
comprise as many distinct molecules as is desired, and may be 
constructed according to any known procedure for making or 
isolating libraries of molecules. 

In CISTEMs involving target libraries, two strategies are 
used, if necessary, to increase the concentration of each 
molecule in the target library to the minimum level required 
for effective CI STEM selection, i.e., to achieve the 
concent rat ion required to confei protection from the select ion 
ag.._ic upon thos< od L; whose lmpiinteO polypept ides bind to 
o n e or mor e o f t h e mo 1 e c u 1 e s i n f 1 \ e target library. In one 
s t. r <j : e q y , the .'I CI' EM i\: 1 i population and t hr tanjet iibraiy 
are each amplified, divided into tractions, and exposed to one 
a n o t. h e r i n mi ore • titer wells. This all o w s the conce n tration of 
t <: i <jot m< ilfcuh':; t : * be r \i f 1 i c i c-n t 1 y hi gh t . < > eon 1 e i p : o t er t i on 
noon e< ■ • 11 : whe >: ■ < ■ : :np : l n t ^d p- d V] ■* p: ides bind t o on* > ; - : mo i < • 
vm • 1 eon 1 f •: . in t h»' target 1 d>i .u y. 

In the .second strategy tor mci cor-niq the concen t r a t. l or i 
o! each mo 1 ecu i * ■ in t he target library, the target, libiary is 
soci'd-d by the CJSTEM (-♦■11 popu 1 a \ i oi i itself. As, was 
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extrusion, leaving the cell in tact. Therefore, a CISTEM cell 
population that secretes its own target library can be 
engineered as follows. 

As in the basic CI STEM described above, a cell population 
that lacks a gene for the imprinted polypeptide is transformed 
with a library of exogenous copies of this gene, in which 
variable oligonucleotides have been inserted at the genetic 
site corresponding to the imprinted domain. For example, in 

* „ tt -r>7V rTqTEM cell population 
the construcLioii ~- — — - 

described above, E. coll UH203 is transformed with pEV218I, a 
library of ompA piasmids with variable oligonucleotides at the 
genetic site corresponding to extracellular loop IV of the 
membrane protein. The resultant cell population displays a 
library of exogenous oligopeptides at The imprinted domain. In 
an extension of the basic CISTEM, this cell population is then 
transformed with a second extra-chromosomal genetic element 
that encodes a secreted library of oligopeptides. The OmpA- 
imprintmg cell population, for example, is transfected with a 
library of filamentous bacteriophage engineered to display, 
exoaenous oligopeptides at gene protein 3. (Many such 
libraries are described in the literature, reviewed in Ja-nda, 
1994 and Phizicky et al . , 1995.) Each cell in the population 
now has two extra-chromosomal genetic elements: one encodes 
OmpA, in which an extracellular loop displays an exogenous 
oligopeptide; the other encodes a stably extruded M13 phage, 
in which g P 3 displays a second exogenous oligopeptide, 
distinct from the one displayed on the cell surface (figure 
14) . 

The CI STEM cell population expressing two extra 
chromosomal cenetic elements is then exposed to the CISTEM' 
selection agent, following the same protocols for exposure as 
were described for the basic CISTEM. Upon exposure to the 
selection agent, a clone in the CIST EM ' s population is 
protected only if the target library molecule that is secreted 
b « t h p -lone binds specifically to the clone's imprinted 
domain. That is, a clone is protected by an autocrine loop, in 
the specific CISTEM in which E . coir displays an OmpA library 
and secretes an M13 library, K3 serves as the selection agent 
(frgure 13). An E . coll clone is protected from K3 only if the 



BNSDOCID <WO 9930151A1 I > 



WO 99/30151 



PCT/EP98/07922 



- HI - 

exogenous polypeptide displayed at loop IV of -GmpA bands 
specifically to the MI 3 that the clone itself secretes. The 
genes for both members of this interacting pair of 
oligopeptides are easily recovered from the protected cell, in 
sum, through autocrine protection of clones, a CI STEM with a 
target library selects for specifically interacting pairs of 
oligopeptides, as well as for the genes that encode them. 

In another example of autocrine protection, a CISTEM is 
constructed according to the procedure just described, but a 
hemolysin library is used in place of the phage Ml 3 library. 
Hemolysins are secreted bacterial polypeptides that tolerate 
exogenous insertions. Thus, the hemolysin gene can be used to 
carry exogenous genes, which encode po I ypept ides that are then . 
secreted from the cell. 

Two possibilities for an intracellular C I STEM are 
depicted in figure 20, In both possibilities, the protective 
partner : partner interaction occurs within the cell. Version A 
shows a typical C I STEM, wherein the two partners are 
transported to the cell surface. In version B, the positive 
selection occurs, because the two partners cannot be 
transported to the cell surface and e.g. accumulate within the 1 
cell and /or bee; mi* substrates f o i p> < >{ < » i n dog r ada t i on 
pa t hwa y r e . 

Certain embodiments of this invention arc used to 
i dt ■ 1 1 1. i I y nut l iiens a . . s« u : \ n * -o with t minus. A '"ISTKM ice the 
identification and isolation of cancer antigens may be 
cons t: i in.: ted as f o 1 low:;: First, a e DUh library i s < i e rived f rorn 
a patient's tumor ee 1 1 s . Protocols for isol.it. ion <d cells and 
d e i ] va t i on of t h* ■ f 'I >NA 1 i \ .r a r y n » « • described in t h" 
literature. The t um< :■ r ■■ de r i ved cl >NAs s e i v- , i s t le ■ v« i r l -il > 1 e 
inserts in the construction ot t tie CJSTKM oe i ] population: 
they are ins.ojtt-d into th*- impr inted po i ypept ide vector at a 
uenetie site cor r ospond l nrj t o the imprinted domain of the 
po ] yp. - p- i de . 'it;.' ■ ' ! >HA. i! • : n : • • r t « i in ,i 1 i ( to":" r • - ,j d i nu 
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polypeptide. When expression of the vector-bor^e gene is 
induced, each clone in the CISTEM cell population displays one 
tumor-associated polypeptide, fused to the imprinted domain on 

the cell surface. 

The target compound in a CISTEM lor the identification of 
tumor antigens preferably consists of polyclonal antibodies, 
usually isolated from the same patient whose cells are used as 
the source for the cDNA library. In other words, the tumor 
cells that supply cDNA ana the antibodies that comprise the 
target compound are preferably autologous. Methods for the 
collection of cancer patient serum and the isolation of 
„,..~i~„ a i ,ni Indies, which are used as the CISTEM target 
compound, are described in the literature. 

Following incubation of the CISTEM cell population with 
the target antibodies, the selection agent is added. Any clone 
in which the imprinted domain displays a tumor-associated 
polypeptide that binds autologous antibodies has a different 
status concerning the selection agent than a cell with no such 
binding. E.g. in case of binding, these cells may be protected 
from the selection agent. By contrast, any clones in which the 
imprinted domain displays a tumor polypeptide that is not 
recognized by autologous antibodies is eliminated. Thus, this 
CISTEM identifies tumor-associated polypeptides that interact 
specifically with autologous antibodies. It is shown in the 
literature that such polypeptides are likely to be 
immunogenic, eliciting a tumor-specific humoral and cellular 

immune response. 

Certain embodiments of this invention are used to 
determine whether certain molecules interact specifically with 
an imprinted polypeptide that has not been altered by the 
insertion of variable oligopeptides. Any outer membrane 
polypeptide that is used as a receptor by a pathogenic agent, 
including polypeptides engineered by the manipulations 
described above, may be used in these embodiments of the 
invention. Because the imprinted polypeptide is not engineered 
to display highly variable oligopeptides, there is no 

imprinted domain", and the selection agent's adhesion domain 
may be located anywhere on the imprinted polypeptide. As m 
the basic CISTEM cell populations described above, the 
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imprinted polypeptide is expressed from an inducible promoter. 
The cell population, induced to express the imprinted 
polypeptide, is exposed to a target library, according to one 
of the methods described above for constructing target 
libraries and introducing them to CISTEMs. Either microtiter 
wells or the autocrine loop methods may be used, to increase 
the concentration of target library molecules. The CISTEM' s 
selection agent is then introduced according to procedures 
described for the basic C I STEM . Cells are protected only if 
molecules in the target library bind to the imprinted 
polypeptide, and thus prevent adhesion by the selection agent. 
Thus, these embodiments of this invention provide an easy and 
straightforward method for assaying a set of molecules' 
affinity for the imprinted polypeptide" 

Therefore, preferred areas, for which' the present method 
is especially suitable is the discovery and isolation of novel 
antigens for use in vaccines, tumors, novel targets for 
therapeutic intervention, novel diagnostic markers, or 
molecules useful for monitoring treatment efficacy. Such new 
products, identified, isolated and prepared by the method 
according to the present: invention is also subject matter of 
the present 1 nventi on . 

According to anotlu.i aspect, the- present, invention relates 
to the colls/the cell population to be used in the present 
Hi ven t i on w] t ii 

a) a membrane polypeptide which has the polypeptide of 

interest in its extra- or intracellular domain and 
bi ci selection s i t; e being specific for a selection agent, 
such that upon binding ot a target compound to 'he polypeptide 
oi interest the selection site r>l ! Is which exhibit such a 
nindmg r,t said target, compound to ;okI polypeptide of interest 

modi lied in a manner thcjt said so lection agent may be used 
tor selecting those colls which exhibit said binding of said 

t-i:<J"t < r . Urii j t,,, .; rilf j » ,, , ! V pt * p !. 1 ■: i < i :nt«'Zost J j r , In cells which 
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specifically for easy definable projects and standardized e.g. 
for medical purposes, such as treating cancer with immunological 
methods, treating infectious diseases resulting fro. infections 
with highly variable pathogens. 

According to another aspect, the present invention is 
erected to a method for producing a pharmaceutical preparation, 
which comprises the steps of identifying and isolating a novel 
pharmaceutical^ active compound by the CISTEM method according 

, . ^^^ion. addina pharmaceutical ly acceptable 

auxiliary components, such as buffer substances, stabilizers, 
adjuvants, etc, and completing the pharmaceutical preparation. 
The novelty of the isolated compound can easily determined by 
comparing structural data (ammo or nucleic acid sequence, 
chemical formula, NMR (or other spectroscopy data), etc.) of the 
1S oiated compound via databases with the known compounds. 

The invention is further explained by the following 
examples and the accompanying figures without, however, 
restricting it thereto. 

Fig 1 shows type I-IV polypeptides; Fig. 2 and 3 show ^ 
adhesion domains of membrane polypeptides; Fig. 4 shows an 
ICAM-1 protein; Fig . 5 shows the components of a CISTEM; Erg - 6 
shows the binding interactions between the imprinted domains 
and the target molecules; Fig. 7 shows a CISTEM selection Using 
a bacteriophage; Fig. 8 shows selection and amplification m a 
CISTEM; Fig. 9 shows OmpA plasmid P EV2 1 8 ; Fig. 10 and II show an 
interactive CISTEM run; Fig. 12 shows an imprinting example; 
Fig 11 shows the E . coli/ OmpA- CI STEM; Fig. 14 shows a CISTEM 
using bacteriophage K3; Fig. I. and 16 show the preliminary 
optimization of C I STEM selection by page K3. Fig. 17 to 1 9 and 
21 show the monitoring of the dynam.es of the C I STEM selecti- 
on; Fig. 20 shows two possibilities of intracellular CISTEM 

EXAMPLES 

i. EXAMPLE 1 : AN E . COLI OMPA/ COL I PHAGE K3/mAbT7 CISTEM 
a) E. colx Outer membrane protein A 
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E. coli Outer membrane protein A is a 325 — residue 
polypeptide comprising an ami no- termi nal , eight-stranded 0- 
bariel with four ex t r a - eel 1 u i a r domains, three periplasmic 
loops, and a roughly 150-residue periplasmic carboxyl- terminal 
moiety (figure 3)(Freudl, 1989, Morona et al . , 1985). The b- 
barrel conformation is extremely stable, and can be denatured 
only by boiling in strong detergents. Naturally present in 
approximately 2-3 x 10 copies per cell, OmpA is among the 
most abundant proteins in the E. coli outer membrane. It is 
also widely conserved in bacterial evolution. 

As was descr ibed in the specification, exogenous 
oligonucleotides, inserted into the ompA gene at sites 
coi r espond i na to extracellular domains of the polypeptide, arc 
normally translated as part of the polypeptide, arid do not 
disturb the polypeptide's insertion in the outer membrane. < 
Furthermore, the oligopeptides encoded by these exogenous 
genetic insertions are displayed on the surface of the cell, 
adopt stable folded structures, and can exhibit enzymatic, 
agonistic, or antagonistic activities. - : 

b) Construction of a cell population for testing and 
optimizing CISTEM selection 

To construct a cell population for use in testing and 
opiiiuizinu <"i.>TKM s*. • L » ""■ t luii, an r. . cc / r popu 1 a 1. 1 on lacking 
wild-type OmpA was engineered to express a pi asmi d-encoded 
copy of OmpA , in which extracellular loop IV of the 
p--* 1 ypep t i d» ■ displayed t he TV tag tn iqop^pt. i d< 1 (M>. t - Al Se r - 
M » > t -Thr - ('-. i y- ( i ' v 1 n-i i 1 n-Me t - o 1 y ) , which b i ndr; to the 

mm" rchil iy available T / an t i body ( No vauene i . In pF,V.? 1 H 
(Fi eii(]] , 19^*), t ii° ompA gene, wliidi is. under the IPTf> 
mduoible coil! to] of the lac promoter, has been engineered to 
boar a pel v linker rich in re s t r i e t ion ( m i d o n u c 1 ( > 1 1 s < 1 s 1 1 * > s at a 
. : . a , t ■ \ ■ r « ■ a , ■ ■ ■ a • ■ : < r i ^ . . t ■ o n d i r i f J t - • :■: t i , e a d ] U i a I ! a o e 1 V of t h « 
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linearized plasmid with sticky ends was isolate by standard 
gel electrophoresis. Two single stranded oligonucleotides were 
synthesized by conventional technology (custom 
oligonucleotides are available from commercial laboratories, 
and methods for synthesizing single-stranded. oligonucleotides 
are commonly use and described m the literature) to read as 
follows: 



Ol igonucleotiae 1: 
5' - CG [ATGGCTAGCATGACTGGTGGACAGCAAATGGGT] - 3' 

Oligonucleotide 2: 
b , _ CTAG [ ACCCATTTGCTGTCCACCAGTCATGCTAGCCAT 1 CGAGCT - 3' 

The sequence in brackets composesThe T7 tag gene 
sequence. These two single-stranded oligonucleotides were 
annealed in Klenow reaction buffer (MB I catalogue), forming 
the double stranded T7-tag gene with sticky ends complementary 
to the sticky ends on P EV218, linearized by Sad and Xbal as 
described in the previous paragraph. When the double-stranded 
oligonucleotide was ligated (MB I Fermentas ligation buffer) 
xnto the linearized plasmid, the T7 tag gene was inserted, in 
frame, into the om P A gene on pEV218. The new construct was 
designated pEV218T7. 

The plasmid P EV218T7 was used to transform a population 
of thP strain UH203 (lac, supF. ompA, recA, proAB , rpsL) 
(Freudl, 1989), according to common methods of transformation 
described in detail above. 

c) Preliminary optimization of CISTEM selection by coliphage 
K3 

Coliphage K3 invades E . call by binding to loops I, II, 
and III of OmpA. Loop IV is not used by the phage, and, 
indeed, exogenous oligopeptides at loopIV do not affect K3 
susceptibility of the cell (the K3 adhesion domains are shown 
in figure 3; bacteriophages are reviewed briefly in the 
specification, and bacteriophage K3, as well as its receptor 
requirements, are described in Freudl, 1989 and Morona et al., 
1983). The following procedure was performed to fulfill two 
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objectives: a) to conclusively demonstrate thatr - cells whose 
imprinted polypeptides bind the target compound are protected 
from K3 attack, whereas cells whose imprinted polypeptides 
fail to bind the target are eliminated by the phage, and b) to 
achieve preliminary optimization of the duration of incubation 
of the CISTEM cell population with phage K3 . The principle of 
this experiment is illustrated in figure 14. 

E. coli MC4100 (pEGFP-Nl), a kanamyc i n- res l s t an t , ompA 
wild-type strain, which will here be relerred to as U (for 
^Untagged")/ was cultivated overnight an Luria broth (LB) 
supplemented with 25ug kanamycin/ml . R. coli UH203 (pEV218T7), 
which will here be referred to as T (for „Tagged XN ), is an 
ampicill in- re.si s tant . ompA strain that has been transformed, 
as described in the specification, by a plasnud encoding OmpA 
bearing the T7 tag sequence at the fourth extracellular loop 
of the polypeptide. This strain was cultivated overnight in LB 
supplemented with 100 pg/ml ampici 1 1 in/ml and 5 mM IPTG for 
induction of pEV218T7 expression. Each overnight culture was 
diluted with LB to OD600 = 0.001. To create each of two mixed 
populations, 50n\l aliquots of the U culture were combined with 
c >0ul aliquots of the T culture. One mixed population was 
incubated foi 20 nun. at 37 C with 20;j] TBS alone, while the 
othei incubated under t.ht- same conditions with / Upl bB:i plus 
0.1 pg anti-T7 tag mAb (Novagene mAbT7 ) . (in table 4 , the 
pres^nr-^ abs'Tic" 1'iAbT ; \r miijated by a t oi -.; I h,nj<j 

K3 dilutee! in 1 ml LB were then introduced to each mixed 
population at a Multiplicity of Infection (Mo I) = 10. (Phage 
stock was prepared by the convent ion.il method for preparation 
of T - even ph -i ' i- ' (Calendar, 1 ( i H H } . ) Camp ! e , i 1 i qi k.j of ] Mt j pi 
wcrf' taken irein each populat ion at t:i»> appointe i t lni'v; shown 
i n table "\ , namely, immediately piocding introduction of 
l-'h'i'i" i shown i n trie table as T u, Phage ), immediately 
t o 1 b -w i n- \ l n t" i • .due t i -. .n < > \ phage (shown in the table as. T n , 
i'bi'i" ' ' , a no . . t m i n . , in m i n . , C; v.\ \ \ . . , en m i n . , f *0 mm., 
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The results of this experiment demonstrated that 
interaction between mAbT7 and the T7 tag at loopIV of OmpA 
protected cells from phage K3 . This conclusion is summarized 
most concisely in table 5. Cells exhibiting the T7 tag at 
loopIV of OmpA were protected from phage K3 infection, and 
this protection was dependent upon incubation of the cells 
with mAbT7. Cells that expressed wild- type OmpA, rather than 
the Om P A-T7 tag fusion were susceptible to phage K3, 

AbT7 . Thus, cells 

irrespective of tne present a ~~e - - 

whose imprinted polypeptides bind the target compound are 
protected from K3 attack, whereas cells whose imprinted 
' llinen HdMs fail to bind the target are eliminated by the 



±J W -A. 



phage . 

This experiment also serves in the preliminary 

» >- v—* —V, -i^o \ ^ shown in 

optimization of CISTEM selection by phage KJ . - -s 

table 4 and figure 15, in the mixed population of U and T 
cells that were incubated with mAbT7, U cells were completely 
eliminated from the population after 30 min. incubation with 
phage K3 . T cells, by contrast, maintained a roughly stable 
population well beyond 30 mm. of incubation with the phage. 
This suggested that when starting with MoT = 10, the minimum 
time period for phage K3 to eliminate all cells that failed to 
bind the target would be 30 - 60 mm. 

The initial precipitous drop in T cells followed by 
abrupt stabilization (see figure 15) suggested that the 
antibody may not yet have come to equilibrium in binding the 
T7 tag at the time the phage were introduced. Pre- incubat ion 
with the antibody was 20 mm. in duration , and complete 
protection of T cells was apparently achieved at approximately 
30 min after the introduction of phage, suggesting that the 
time required for the antibody to come to equilibrium in 
binding the tag was approximately 50 mm.. This suggestion was 
confirmed by the following preliminary optimization of the 
duration of the cell population's incubation with the target. 

e tr ,sr, c ii and T were cultivated overnight under the 
conditions described above. Each of five mixed populations of 
U and T cells was created by taking a 50pl aliquot of the U 
culture (OD600 = 0-001) and combining it with a 50ul aliquot 
of the T culture (OD600 - 0.001). As is shown in table t, and 
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figure 10, each mixed population war, incubated a~t 37 °c with 
7nM mAb'1'7 diluted in PBS for a different time period: 1 mm.; 
10 nun.; ?0 mm; 30 ruin; or 60 min. . Phage PC 3 stock, diluted 
with LB to 1 ml and Mo I - 10, was added to each mixed cell 
popuJ at i on i mmedi a t e 1 y f ol 1 owing the population' s i ncuba t ion 
with the antibody. Each population was incubated with phage 
for 30 min. Aliquots were then taken and their composition of 
U and T cells determined as described above. As is shown in 
table i>, cells exhibit inn wildtype OmpA remained unprotected 
from phage irrespective of the duration of incubation with 
mAbT 7 . By contrast, cell:, exhibiting the T7 tag at loop TV of 
OmpA received increasing protection from the antibody as the 
duration of incubation was lengthened. As was expected from 
t hi e r e s u 1 T. s f h e p r e 1 i mi n a r y o p t i m i z a t i o n of p h a q e K 3 
incubation (described above) the antibody confered complete < 
protection upon T cells only after an incubation time between 
3 0 and 6 n mi n . 

d) Initial test of CISTEM selection : 

The p r e 1 i mi na r y , s i nq 1 e-pa r a me t er opt imi zat ion s of 
incubation time with rnAbT / and ♦ he duration of K3 selection 
allowed ah initial test ci the CiSTkM's capacity to select 
rare cells whose imprinted polypopt ides bind the target from a 
1 •.. .; kq i os: i'- i • ; ;'rb.- wh^.".< i mp-: i n i * d p >. • ! y p < j p t o i ♦ 1 : • l a i i ! b l nd 
the target. To demonstrate the selection of rare- cells, the 
f o 1 1 o w i no p r o e e dure w a : p e r f o r in e d . 

K . roll stiains n (Untagged) and 'J' (Tciag(vi) are describee] 
above. II wm:; ■ "U 1 t ivated overniqh* :n IB supplemented with I'^ia 
k a n a m y c i n / ml. T was cultivated c j ve r n l q h t in LB :;upp1 em< ? ntf-d 
with liinng amp l c i 1 ! l n /m 1 and 1 mM I BTC; to induce expression of 
pEVV 1HT7, the plasmid eneedinq OmpA with the T / tag at loop IV 
(if the polypeptide. L a e i i e v e r n l c } h t cut * in e was. dilated with LB 

C ■ •' :[ >y ,!. !;; » . f : ? ■ ' . _ /. ' i i ; ; \ ■ : \ 1 ■" hi- ■ ' i I ill'' ' ' e: J ] ' U J e W- 1 S ' illiM i h» d 
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expression induced in the T strain (only ^ 

ma de this strain significantly less viable than the U St an 

r standard conditions o£ cultivation. The m xxed P°P»^ °" 

was incubated at 37c with 68nH ^bT7 .described above, diluted 

in PBS for 60 1». . Following incubation, 100 ul alrguots of 

the cell population were plated on LB plates supplemented wrth 

25 pg kanamycm/n,! or lOOpg ampicil 1 in/ml . Colonies were 

counted after overnight growth, showing that the population 

. r ...„- iBa , e iv 1 T cell for every 120 U cells 
consisreu ol apt ->r. ~ ., 

(table 7) Phage K3 stock diluted in LB was then added to the 
cell population at Mol - 2, bringing the total volume to 1 ml . 
The cells were incubated with phage for 120 »in. at 37C. 
Following incubation wrth phage, 100 pi alxquots of the ceu 
population were plated on LB plates supplemented with 2 ug 

■ , in o, n wirillm/mi. Colonies were counted 

kanamycm/ml oi lOJpg ^P 1 - 1111 

after overnight growth. 

Control experiments were performed according to the same 
protocol, except that PBS was added without mAbT7 and/or LB 
Zs added without Phage K3, as shown in table 7 The result, 
in table 7 demonstrated that rare cells (1 in > 100) 
exhibiting an imprinted domain that bound the target «re_ 
completely protected from phage K3 while all cells that failed 
to bind the target were quickly eliminated from the 

population . . 

A systematic test of the CISTEM's sensitivity in 
selecting rare cells that bind the target from a vast excess 
of cells that fail to bind the target is performed as follows. 
Overnight cultures of E. col, strains U (Untagged and T 
(Tagged) are described above. 50 ,1 - 1 -1 alxquots of the U 
and T overnight cultures are combined to create four 
populations, each representing a distinct ratio of U cell- to 
T cells- 10 8 U to 10 4 T; 10* U to 10 8 T; 10 8 U to 10 T; and 
10 9 U to 10 2 T. Each population is incubated for 00 mm. at 
37C with lOnIM mAbT7 . To monitor the population composition 
(i e the numbers of U and T cells), every 30 mm. (i.e., at 
Bo'mm. and 60 mm.) two aliguots of b pi are taken, one 
aliquot is plated on LB plates supplemented with 100pg Ap/ml, 
and the other aliquot is plated on LB supplemented with 25pg 
Km/ml The plates are then incubated overnight and colonies 
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counted to determine population composition. Immediately after 
removal of aliquots at GO nun., bacteriophage K3 (described 
above) are introduced to the cell population at Mo I =■ 2. The 
admixture is incubated at 37°C for 180 min. . Again, to monitor 
the population composition, a sample aliquot of b pi is taken 
by micropipet t e every 20 min., plated, grown, and U and T 
cells counted as described above. After incubation with phage, 
a sample of approximately 1000 cells is removed, plated, 
grown, and its composition determined as described above. 

Demonstration of the CI STEM'S capacity to select rare 
cells that bind the target compound f i um a background of at 
least. 100 times as many ce 1 1 s that fail to bind the target 
permits a final, two-parameter optimization of C1STEM 
performance. Each of five populations of~ T is induced for lb 
min. by a distinct amount of I PTG in the medium: 0 mM; 0.2 5 
mM; 0.5 mM ; 1.0 mM; or 2.0 niM . Five mixed populations of 10^ U 
and 10^T, each representing a different level of induction of 
T cells, are created by combining aliquots, as described 
above. 10 nM mAbT7 is added to each population, and the 
population.-; are incubated for 60 min. at 37°C. During the 60 
min. incubation, the population composition is monitored, as 
described abovo, by taking sample aliquots of b pi cvory 20 
mil!., plating, gr ;wing, and counting the numbei s o* u and T 
cells. After t"he removal of an aliquot at GO min., ooliphago 
s ■ i - . i : 1 t i . u n c a • c i O , i u M . . I . - r . ■ . c v » • i t lit- 1 1 * j ;■: t \ h i mi!;., T tv 1 
populations art 1 monitored by b pi aliquot sampling every 20 
min.. The [1PTC;1 t ind duration ul incubation with K ~i that 
result m t ho • - i i m i n.i t i on oi ti 1 j ii cfdls, with concomitant 
1 1 i i m i i j it- ] ( ,n ■ ■■ I : • ••.-;» •: t T . < > : J . - , » i - ■ opt i no i i p. 1 1 .une t va ] u<-s 

n a us*' in th«* • ' I :;tkm . 

Tl m > ('{linn ;ot kmi trials. ( io. ;or ibed here p r o v i do p. a r aino t e i - 
value;; f I I PTC, ] , duration oi ; ::ouba t i on with K'i , [mAbTV], 
duration oi i n- m it i 1 ) . on w:th mAbT / ) t Ji.it allow the -"I.STF.M to 
' b d ! < • v a : i 1 ! * s • . ? ! -I ! i 1 • . b i ?:d th»- t • j f '. j » o t w p | | ... 
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OmpA plasmid pEV218 (described above, shown_in figure 9, 
construction described in Freudl, 1989), bears a polylinker 
rich in restriction endonuclease sites at a genetic region 
corresponding to extracellular loopIV of the polypeptide. This 
plasmid is digested with SacI (enzyme and reaction buffer from 
New England Biolabs) and Xbal (enzyme and reaction buffer from 
MBI Fermentas), under conditions described by the suppliers, 
yielding a fragment of the restriction site polylinker and the 

■ i „•>-,, — ■; ^ linearized Dlasmid is isolated by 

standard gei electrophoresis. 10 6 distinct, single-stranded 
oligonucleotides are synthesized by conventional technology 

(synthesis of variable oligonucleotides is described above) to 

read as follows: 

S ' NNGGTCTAGA (VNN) 2 0CGAGCTCNN3 ' 

in which N represents ail four nucleotides, each with equal 
probability; and V represents C, G, or A, each with equal 
probability. (E. coli strain UH203 is supF, capable of 
translating the UAG stop-codon. Therefore, variable 
oligonucleotides inserted into the OmpA vector P EV218 must be 
free of the stop codons UAA and UGA, but not of UAG.) Using a 
standard protocol described in the literature (Brown, 1992), a 
5- primer and a proof reading polymerase (such as the Vent 
polymerase) are used to generate double-stranded 
oligonucleotides from the single-stranded templates. These 
double-stranded, highly variable oligonucleotides are then 
digested with SacI and Xbal, under conditions described by the 
suppliers (New England Biolabs and MBI Fermentas, 
respectively), yielding highly variable oligonucleotides with 
non-complementary sticky ends. The highly-variable 
oligonucleotides are then ligated (MBI ligation buffer, 
conditions described by supplier) into pEV218, linearized as 
described in the previous paragraph. Because the SacI and Xbal 
sticky ends are non-complementary, they ligate in only one 
oligonucleotide per rlasmid, and in only one orientation. 
Conventional electrophoretic size fractionation is used to 
isolate ligated plasmids containing an insert. The product of 
this procedure is a population of plasmids (collectively 
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designated pEV2 181) bearing, at the genetic site corresponding 
to loop IV of OmpA, a highly variable exogenous sequence free 
of UAA and UGA stop codons . 

The variable plasmid population, pEV218I, is used to 
transform E. cold strain UH203 (lac, supF, ompA, lecA, proAB, 
rpsL) (this strain, its propagation and its transformation, 
are described more completely above, and in Freudl, 1989) . The 

UH2CH population is grown in Lurid broth supplemented with 

ft f 
lOOug Ap/ml, yielding a population of 10 cells comprising 10 

clones, each one of which carries a distinct member of the 

pFV2 1 R 1 poj >u I a t i on . 

f ) CI STEM- selection for OmpA exhibi ting high-affini ty for 
mAbT7 - 

The UH203 (pEV218I) population of 10 8 cells plus 
approximately 100 UH203 (pKV218T7) cells is incubated at 37C 
for 15 miii. in 100ml LB medium supplemented with 100 pg Ap/ml 
and IPTG between 0.25) and 2.0 mM, as determined in the 
optimization trial described above. This, yields a population 
of 10 8 olcr.es, comprising ] 0 distinct cJonr^:;, each of which 
displays a dist inert, exogenous., ol i q. >popt i do sequence at loop 
TV of OmpA. One of these o ! ones displays the T / tag sequence. 
After the initial 15 mm. induction of pKV? 1 H I /pEV2 1 8T7 
• >:pi "S'i , i OnM :r.Ab?' ; -s a : ss t ■. . tin iusmi oi ceils.. The 
colls are incubated with the iaAb ioi (>0 min. at 37°C. 
Col iphaqe KA is. then introduced at Mol / (the procedure lor 
growth and hand! i nq . • f K "i is. ie. ,c : : ; m -d .ibev.-, and in K: eudl , 
]^M0 12e, Morena .:]., 1 ' - ^ 1 ' ) . I n ■ -uh. . r l ni , with K i 

p r -ds : [ i.< minimum amount id I i n:e required, as 

determined in the optimization tiia! described above. 
Ft > 1 lowing incubation with phage, t h ■ • fell r >o p; 3 1 a t 1 n n 1 s 
eentrituqed to separate cells, t r orr. phau" and ( hp pioduets ol 
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To amplify the numbers of selected clones/JThe selected 
cell population is grown to 10 6 cells at 37°C in 100ml NB . 
Expression of OmpA is then heavily induced by 2 . 5 mM IPTG for 
15 min . Following induction, 10 4 cells from the selected 
population are plated onto NB agar containing 0.-5* sodium 
deoxycholate. (Sodium deoxycholate prevents growth of cells 
that fail to insert OmpA in the outer membrane; such cells are 
"false positives", selected by the CISTEM simply because they 
display no OmpA.. and not because of their capacity to bind 

1. ,„ r tv of OmpA. The sodium deoxycholate supplement 
also prevents K3 infection, providing a further guarantee 
against continuing infection.) Plated cells are incubated at 
37°C to grow colonies, which are composed of ceils m which 
loop IV of OmpA contains an exogenous oligopeptide with high 
affinity for the target compound, mAbT7 . 

ii. EXAMPLE 2: AN E. COLI OMPA/ COLI PHAGE K3 /NEISSERIA 
MENINGITIDIS ANTIBODY CISTEM 

a) E . coli Outer membrane protein A and Neisserial Opa 
proteins 

Several recent findings show that one important OmpA 
function explaining such high levels of expression and 
evolutionary conservation of the polypeptide is adhesion to 
and invasion of host tissues. It has recently been 
demonstrated that E. coli OmpA contributes significantly to 
xnvasion of brain microvascular endothelial cells a critical 
step in pathogenesis of neonatal E . coli meningitis. 
Similarly, OmpA homology are directly involved in microbial 
adherence, invasion of host cells, and pathogenesis by 
Haemophilus influenzae, Neisseria meningitidis, Neisseria 
gonorrhoeae, and other, virulent microbes. 

The structural homology between OmpA and a wide range of 
antigenic adhesins makes OmpA well-suited lor imprinting for 
use in applications requiring mimicry of homologous adhesins. 
One such application is vaccination. When a polypeptide (m 
this case, OmpA) is imprinted to bind to antibodies composing 
the sera raised against an antigen, the imprinted polypeptide 
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may elicit an immune response very similar to the "one provoked 
by the antigen (Folgori et al . , 1994). Antibodies suitable for 
use as the target compound in a CISTEM for imprinting E . col i 
OmpA to mimic Neisseria meningi tidis Opa proteins (Olyhoek et 
al., 1991) include mAb B33, which reacts with all Opa 
proteins, mAbs P322 and P514, which react with a broad subset 
of Opa proteins, and mAb B306, mAb A222/5b (Olyhoek et al . , 
1991), and others described in the literature". 



b) CISTEM-selection for OmpA exhibiting high-affinity for 
antibodies against 

Neisseria meningitidis Opa proteins 

The construction of a cell population for testing and 
optimizing CISTEM selection, the optimization and 
demonstration of CISTEM function, and the construction of an 
OmpA-imprinting CISTEM cell population are performed exactly 
as described above: this C I STEM, like the one described in 
Example 1, uses phage K3 to select from a cell population 
displaying variable inserts 20 amino acids in length at loop 
IV of OmpA. However, whereas Example 1 imprints OmpA to bind 
to mAbT 7 , this CISTEM imprints OmpA to bind to antibodies 
against Neisseria meningitidis Opa proteins. Therefore, where 
the protocol described in Example ], pari f, -..ill:: for 10 nM 
mAbT7, this C I STEM calls for 10 nM of each of the tollowing 
antibodies: B33; P322; P514; B30 6; and A222/r>b. Aside from 
this substitution, the protocol for imprinting OmpA to bind 
ant. i-()pa anribodios is as described in Example l, part f. 

ill. EXAMPLE 3: MONITORING THE DYNAMICS OF CISTEM SELECTION 



To monitor the process by which cells that 
t a r a, ■ t r\ unr -a uj nd a m ■ * ■ _1 ] m i n a t ed , wh j ! < ■■ * h • 



fail to hi nd t he 
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amplacxllxn/ml and 2mM IPTG to induce expression of P EV218T7, 
the plasmid encoding OmpA with the T7 tag at extracellular 
loopiv of the mature polypeptide. Each overnight culture was 
diluted with LB to OD600 = 0.005. To create each of three mixed 
populations, aliquots of the two cultures were combined to 
compose a population in which the ratio of T cells to U cells 
was approximately 1 to 200. Sample aliquots were taken from the 
populations and plated on LB plates supplemented with lOOyg 
™n^il in/ml. and on LB plates supplemented with 25ug 
kanamycin/mi. After overnight incubation, colonies were counted, 
showing that the ratio of Tagged to Untagged cells was 
approximately IT to 190U (table 8, t= -60) . Two of the three 
mixed populations were incubated ror ou nun. *u 3 . „ ^t„— nux, 
whereas the third population was incubated under the same 
conditions with 68nM mAbT7 (described above). (In table 8, the 
presence or absence of mAbT7 is indicated by a + or a -, 
respectively.) After one hour of incubation (i.e., at time t-0 
in table Al ) , Phage K3 was introduced to two of the three mixed 
populations at a Multiplicity of Infection of 6. (In table 8, 
the presence of absence of phage K3 is indicated by a + or -, 
respectively.) Over the next 180 mm., sample aliquots were 
taken from the populations at the times indicated m table 8, 
namely, immediately preceding the addition of phage, and then 
every 15 minutes thereafter. Aliquots were used to estimate 
phage titer, and were plated on the appropriate antibiotics to 
determine numbers of U and T cells. 

Figure 17 shows the dynamics of phage titer and Mol over the 
course of CISTEM selection. Mol climbs exponentially and reaches 
a maximum near 100,000. Figure 18 shows the dynamics of cell 
growth and death during C I STEM selection. The rapid exponential 
growth of phage results in a monotomc decline of U cells, 
irrespective of the presence of absence of mAbTl . By contrast, 
in the presence of mAbT7, T cells are protected from the 
P ^ rw ly high Mol, and consequently they expand in number. 
Thus, specific interaction between the antibody and the epitope 
displayed at OmpA loopIV resulted in selection of cells 
exhibiting the epitope, while cells that did not exhibit the 
epitope were eliminated. This result is summarized in figure 
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iv: EXAMPLE 4: REDUCTION OF BACKGROUND THROUGH _THE USE OF 
HELPER CELLS 

The term "background" is used to refer to cells that fail 
to bind the target compound but nonetheless remain in the 
selected population. In general, it is desireable to minimize 
the number of background cells, such that CISTEM reliably 
eliminates cells whose imprinted domains fail" to bind the target 
compound, while selecting those cells whose imprinted domains do 
bind the target compound. The following experiment illustrates 
one strategy for reducing background. The- C 1 STEM cell population 
is supplemented with "helper cells", the function of which is 
propagation and amplification of the selective agent. This 
amplification of the selective agent, increases the intensity of 
selection, and thereby reduces residual background cells. The 
helper cells lack a selectable marker that is present in the 
rest of the CI. STEM cell population; this permits efficient 
elimination of the helper cells after they have served their 
purpose of amplifying the selective agent. 

UH203 (pEV218T7) , the "T" strain, was cultivated overnight 
in LB supplemented with lOOpg amp i ci 1 1 i n/ml and ImM IPTG to 
induce ex p r e s s i o n o f t h e o nip A p 1 a s m id. A r a b a ckqround strai n 
t h a t do- ; not e x h i b i * t h e T 1 e p i t o p o a t 1 o o p 1 V o f < jmp A , 
UHFl(pEV218) was used. UHF1 is virtually isogenic with UH203, 
rxcer * M M i it cairns a gor.f 1 I o : r *h 1 ramphoiii i r es x s t a nee . 
The pldsmid pEV21H as identical to pEV218T7, except that it 
carries a rest rict ion-enzyme site polylinker in place of the 
gon^ tor the T taa. UH Fl ( pEV? 1 H ) , the "H" stiam, was 
o\i ! t. i vat.od itvri ninht in I.M s uj 'p 1 eruen t ed with lou^g .impiri 1 ! in/ml 
and IrnM il'TO ti< induct 1 expi.'SSLon ol pHV2 1 h . To ei.-atc mixed 
pt >f >u i ci t i ons , filujuot of thr ,X T" and "U" s t rain.-, wer. e comb i nod 
to compose population. 4 ; with a rat m of T t o U cells ol 
appr ox mat ed y 1 to 1(H) (tablo ( n . As helper cells, MCA 100 
pKCj. -p;i] : wa. us.- i. 1 'ihjj I ! X -1 I - • : ( f >Iv . FrN 1 ; : f 

• •. v 4 i i • * ...... J ♦ " » V ' 
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(i.e., at time t=0 on table 9), phage K3 were added to all 

populations at an Mol = 2 . All populations were then incubated 

for two hours. Aliquots were taken and plated on agar 

supplemented with ampicillin alone, or with ampicillin plus 

chloramphenicol. The numbers of U and T cells in the population 

at the end of CI STEM selection are shown in table Bl . The 

background proportion, defined as U cells/ (U cells + T cells), 

is reduced from 34% in the absence of helper "cells to 16% with 

ion,,i r,f hPlppr cells at OD M o . This experiment 
rne duo.j-i-j.oii - — - — -- i 

... . . _ n-'— ; r]p hplppr cells: thev are added 

thus illustrates cne prin<-^pie ~~ •• C1 F C1 

to CISTEM cell population to modify the activity of the 
selective agent. 

V. EXAMPLE 5: THE SENSITIVITY OF CISTEM 

The following experiment was performed to demonstrate the 
sensitivity of CISTEM selection. The E. coli strains 
UH203(pEV218T7), here called «T" cells, and UHF1 (pEV2 1 8 ) , here 
called "U" cells, were cultured overnight with ImM IPTG to ^ 
induce expression of the ompA plasmids. To create mixed . 
populations, aliquots of the T culture were diluted 10, .100, 
1000, or 10,000 times, and then combined with undiluted aliquots 
of the u culture to compose populations with T to U ratios 'as 
shown in table 10. Helper cells were added as described in the 
previous example. One set of five mixed populations, 
representing each of the serial dilutions of T cells (see table 
10), was incubated for 1 hour at 37C with 340 nM mAbT 7 . Two 
replicate sets of five mixed populations (a total of 10 
populations, comprising two of each of the serial dilutions of T 
cells) were incubated under the same conditions without the 
antibodies. Immediately following incubation, phage K3 was added 
to the set of five mixed populations that had been incubated 
with the antibodies, as well as to one of the two sets of five 
mixed populations that were not incubated with the antibodies. 
In table 10, the presence of antibodies or phage is indicated by 
a +, whereas the absence of antibodies or phage is indicated by 
a -. After two hours of incubation, aliquots from each 
population were plated on agar containing the appropriate 
antibiotics, as described above. After overnight culture, 
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colonies were counted to determine the composition of the CISTEM 
eel 1 popul at ions . 

Table 10, shows not only the final numbers of cells, but 
also several indices of CISTEM' s performance in selecting T 
cells from a background of U cells. Enrichment, an index of the 
selection system's capacity for amplifying clones that bind the 
target compound while eliminating those that do not, is defined 
as (T cells/U cells),ir, r ^i- r ,r.„/ (T cells/U cells ) t,<a.>i<> SP i eai0( , . 
Yield, an index of the proportion of all cells that bind the 
target compound that are successfully selected by the CISTEM, is 
estimated by comparing the number of T cells in the K3- 
populations with the number ot T cells in the mAlM 
populations. Yield is defined as (T eel Is ) K^.-mAb* / (T cells) K v. 

vi.: EXAMPLE 6: MAPPING THE EPITOPE RECOGNIZED BY mAbT7 

One application of CISTEM is in mapping the epitopes involved 
in protein-protein interactions. To illustrate this application, 
CISTEM was used to select cells displaying the phage T7 
polymerase epitope that is recognized by the T7 -polymerase 
specific mAbT7 (described above). The plasmid pET17xb (Novagene) 
contains the entire gene for phage T7 polymerase. This plasmid 
w a s d i g e s t e d w l t h rricrococ c a 1 i i u c lease, a <:: c ordin g t t > \ • r o t o c o ] ; ; 
provided by the supplier, to yield fragments of a range of sizes 
from approximately sn basepairs to lOObp. The band ■ .« r i ^ : ■ poi v. 1 i :v j 
to fragments between 60 and 75 bp in length was excized with a 
razorblade and the fragments were resuspended. The fragments 
were t re a t ed with, i' 1 nuclease ( cond i t i on ; > , j ce< >r d inq to suopl i or ) 
t < ■ remove the > ' t A a >sf >ha t es 1 f-t I by ir.icrooM'fM] nii*'* least . The 
t r , i one » 1 1 4 s w« • r • • t ii-i: f d k -sj - hr i y 1 • i t * w 1 by T 1 i >i ; 1 y mio ! e ; ! i d 1 in a s t ■ . 
Tl: (i ompA plasmid pEVJ 1 H ( doscr l bed abuve) w<is cleaved ny the 
restriction enzymes Kc; 1 1 !S u I J and Sail (condition:; ot digest ion 
according to supplier)/ at unique sites in the* genetic region 
coi i e s p oiiduiM t" o t h < • fourth * ■ x t r <-i c ■ « ■ ) 1 \ : 1 - • * 1 < < > p < o t h * ■ rot i:: f 
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Circulation of the vector without insertion of a fragments 
creates a Sail site. Therefore, the products of ligation were 
treated with Sail to linearize vectors that did not contain a 
fragment The circularized vectors containing fragments were 

solTed'by electrophoresis, and used to transform the highly 
competent, commercially available strain XL-1 blue. Cells were 
P :iced on LB supplemented with lOOug/ml ampicrilrn to se act or 
transformants. Colonies were grown in 250ml LB supplemented wrth 
._ = „ qlucose for catabolic repression of low- 

MP ' C1 " " I", I^e omF » plasmid. Plasmid DNA was extracted 
level expression ox tne om*-.- t-j-o it 
using the Qragene kit. 7pg of plasmid DNA was cleaved by Sail 
and Led for transformation of UH203 (an ompA deficient strain 
described above) by electroporat ion and CaMg transformation. 
Cells were grown on LB supplemented with^OOpg ampicillin to 
select for transformants. Th,s population of transformants will 
be refered to as UH203 ( P EV2 1 8T7L) . 

The UH203( P EV218T7L) population was grown in overnight 
culture with ImM IPTG to induce expression of the ompA plasmid. 
Th cells were incubated for one hour with 680nM mAbT7, .flowing 
for binding between the monocolonal antibody and the epitope 
presented at OmpA loopIV by certain members of the cell - 
Ration. Allowing incubation, phage K3 were added at.n Mo 
! 2, and the cells were rncubated with phage for two hours. The 
cells were then plated, and the results of selection (shown in 
fig 21) are summarized in table 11. 25 colonies were sampled, 
their plasmrds were isolated as above, and the plasmids were 

ubj ected to cleavage by the restriction endonuc iease^hel The 
genetic sequence encoding the epitope recognized by mAbT7 (i.e 
the T7 tag? contarns a Nhel site; no other Nhel site appears in 
PEV218. Therefore, only plasmrds comprising the sequence 
encoding the eprtope of xnterest are vulnerable to NheT . 24 
the 25 clones sampled were vulnerable to Nhel. 
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TABLES . _ 

Table 1: Each polypeptide (column A) is listed with the 
selection agent (column B) that could be used in a CISTEM to 
imprint the polypeptide; the respective domains that compose 
the agent's adhesion domains (column C; numbering begins at 
the ammo terminal); a domain that is interacting (e.g. for 
stencai reasons) with the adhesion domain and could be 
imprinted to bind a target compound (column D) 



A: 

Polypep tide 


B: 

Select ion 
Agen t 


C : Adhes ion 
Domain 


O * P> >~ r^i f~\ o /~A 

Imprinted 

i^wma x i i 


OmpF 


Coll cm A 


extracellular 


extracellular 






loops 1-5 


loops 6 and 7 




Colicin N 


ex tracel lul ar 


extracellular 






loops 1, 3-6 


loops 2 and 7 


OmpA 


see figure 


see figure 3 


any of 4 




3 




loops 


I CAM- 1 


rhinovi rus 


Ig domain 1 


Ig domains 2 








and 3 


MHV 


mouse 


doma i n 1 


all other 


receptor : 


hepa t i t i s 




domains 


120 kl> 


vi rus 






q i ycopror e i n 
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Table 2 



Component 



Cations 




Na- 


5-15 




140 


Mg 2 * 


30 


Ca J * 


1-z 


H- 


4x10" or pH 7.4 


Anions 






4 



Intracellular Cone . 
(mM) 



Extracellular 
Cone. (mM) 



145 

5 

1-2 

2 5-5 

4x10' or pH 7. 
110 



Table 3: Serial dilutions of selection agent and target 
concentrations in preliminary optimization of CISTEM 
selection. The cell populations listed in the table are 
described in the text. 

Mol or [Selection Agent ]/ [Cells] 



X5 

a 

o 
CM 

o 
u 

0) 
M 



T5 


<v 


<D 






•<H 


c 




-r-i 




M 


<U 


a 


O, 




>i 


i — < 


H 




O 




a. 





0 


5 


10 


20 


0 




B, 


B 2 


B, 


10 : 




Ai 


A, 


A, 


10< 






A, 


A t 


10> 




C i f At 


C 2 / A* 


C 3/ A, 
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Table 4 



Time [min] 


Phage 


Antibody 


Selection 


Selected 


Number of colonies 


0 


- 


- 


LB 


both 


1085 






- 


Ap 


T 


598 




- 


- 


Km 


U 


402 






- 


Ap 


T 


204 






- 


Km 


U 


34 






+ 


Ap 


T 


440 






+ 


Km 


U 


21 


2 




- 


Ap 


T 


138 






- 


Km 


U 


30 




+ 


+ 


Ap 


T 


336 




+ 


4- 


Km 


U 


31- 


10 


+ 


- 


Ap 


T 


67 




+ 


- 


Km 


U 


3 - 




+ 


+ 


Ap 


T 


222 




+ 


+ 


Km 


U 


2 


30 


+ 


- 


Ap 


T 


23 




4- 


- 


Km 


U 


0 




+ 


+ 


Ap 


T 


139 




+ 


+ 


Km 


U 


0 


60 


+ 


■ 


Ap 


T 


3 




4- 




Km 


U 


0 




4 


4- 


A 

Ap 


T 


152 




i 


i 
i 


ivm 


r T 

u 


0 




4~ 




Ap 


r 
1 


3 




i 
i 






r t 

U 


0 




4- 


+ 


Ap 


r 


114 




4 


4 


Km 


u 


I) 


120 


4 




Ap 


r 


3 




4 




Km 


u 


3 




4 


4 


Ap 


r 


113 




4- 


4 


Km 


U 


3 


180 


■4 




Ap 


1 ( 


3 




+ 




Km 


II 


3 




4 


4 


A P 


I 


106 
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Table 5 



Presence or Absence 

of T7 Tag at OmpA Loop IV 



Tag + 
Tag- 



Presence or Absence of 
mAbT7 in Medium 



mAbT7 + 



mAbT7 



K3 Resistant 


K3 Susceptible 


K3 Susceptible 


K3 Susceptible | 



Time 
[min] 


Phage 


Antibody 


Selection 


Selected 


Number of colonies 


0 






LB 


both 


1188 








Ap 


T 


663 








Km 


U 


484 


+ 




Ap 


T 


12 


+ 




Km 


U 


0 


1 


+ 


+ 


Ap 


T 


18 


+ 


+ 


Km 


U 


0 


10 


+ 


+ 


Ap 


T 


24 


j + 


+ 


Km 


U 


1 


20 


+ 


+ 


Ap 


T 


43 


+ 


+ 


Km 


U 


0 


30 


+ 


+ 


Ap 


T 


75 


+ • 


+ 


Km 


U 


0 


60 


+ 


+ 


Ap 


T 


655 


+ 


+ 


Km 


U 


0 



Table 7 



Time [min]l 


Phage Antibody 


Selection 


Selected 


Number of cells 


0 






amp 


T7 tag-OmpA 


616 






kan 


wt OmpA 


75400 


120 






amp 


T7 tag-OmpA 


2867 






kan 


wt OmpA 


476000 






anip 


T7 tag-OmpA 


0 


+ 




kan 


wt OmpA 


0 




+ 


amp 


T7 tag-OmpA 


2845 


+ 




kan 


wt OmpA 


0 
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Table 8 



rTimr... , - 


K.X. 


mAh_ 






UUffihfU_-CLf 


Tirrr of K3 


MnT 


1-60 


- 




Ad 


T 


269 








Km 


u 


50176 






0 


- 


- 


Ap 


T 


67 7 


4 . 00E+05 


6 








Km 


u 


67872 


i 5 


- 




Au . 


T 


873 




- 




Km 


u 


7347? 


: 






■+ 




Ap 


T 


269 


5. 91E+05 






Km 


u 


54b56 






+ 


Ap_ - 


T 


1120 


11 




Km 


u ! 


5?«64 


30 


- 


- 


Ap 


T 


1832 






Km 


u 


9 52 00 




- I 






Ap 


T 


22 4 


6.00E+05 


— 


Km 


U ' 


5 07 3 6 




+ 


4 


Ap 


T 


1 34 4 




I 


Km 


u 


48832 


45 


- 




Ap 


X. 


38 5 3 


- 




Km ! 


u 


134848 




I 






Ap 


T 


? 6 9 


1 . 00E+06 


— 


Km 


U 


4 7712 




+ 




Ap 


T 


1 3 4 4 


21 


Km 


u 


4 72 64 


CO 






Ap , 


T 


2061 






Krn 


u 


1 4 uOO'» 




_ 






Ap 


T 


3 1 3 


1 . OOK+06 




Krn 


u 






4 




Ap 


T 


j 1 3 




n r 


Krn 


u 






4 




Ap _ 


T 


28 6 / 






Krn 










Ap 




2 2 4 


8 . 00 Ef 0 5 




Km 


U 


< 4 3 H 4 




4 






T 


41.11 


2 0 


Kir 




",H4f 


1^- .,- — + 
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Table 9 



Time 


Helper 


K3 


mAb 


Cell 


Number of 


Background 


[min ] 


Cells 
[pl] 






type 


eel Is 


[%] 


-60 


_ 


— 


- 


T 


21863 












U 


2446080 




0 


- 


- 


- 


T 


26476 












U 


5249760 




120 


_ 


- 


- 


T 


158112 












U 


52240400 






0 


+ 


— 


T 


1562 












TJ 


74 66 








+ 




7 


14600 


33 . 88% 










U 


7480 






100 






T 


256 












U 


4405 








+ 




T 


6830 


15.80% 










U 


1282 
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Table 
10 

Time 
[min] 1 



-60 



K3 



mAb 



Cell 
Type 



120 



U 



T 



Dilution 
of T cell 
culture 



Number of 
Cells 



U 



uncii 1 uted 



10 



100 



1 000 



10000 



1 . 46E+08 



T:U Ratio 



1 Qtr_n^ 



21 500 



1340 



106 



11 



undi luted 



153200 



10 



8736 



100 



985 



1000 



146 



10000 



undi luted 



10 



100 



1000 



10000 



undi luted 



10 



1 0 0 



1000 



10000 



undi luted 



1 . 86E+08 



?E — 04 



9 . 18E-06 



7 . 26E-07 



7 . 53E-08 



15 



5. 37E+08 



256200 



12400 



1500 



270 



18 



446000 



2318 



120 



12 



396500 



264500 



100 



1000 



1 0000 



14200 



1 500 



195 



Enrichment 



5 . 63E + 07 



Yield 



? . 4 3E+02 



4 . 12E+02 



6.7 7E+02 



1 . OOE+02 



100% 



100% 



100% 



72% 



17% 



Background 



60 . 0% 



96. 5 C ^ 



99 . 6% 



100.0% 



100 . 0% 
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Table 11 



iTirae [nnii] 


Bacteriophage K3 


anti-T7 tag raAb 


Number of cells \ 








76000 j 


1120 






2043000 | 


+ 




3513 J 


+ 


+ 


5020 _J 
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CLAIMS 

1.: A method for identifying and isolating at: least one of a 
polypeptide of interest, a target compound or a gene of said 
polypeptide of interest or a gene of said target compound, 
wherein at least one of sard polypeptide of interest and said 
target compound is unknown comprising the steps 

a) constructing a cell population by engineering it to 
express a membrane pol ypept i de which has said 
polypeptide of interest in its extra- or intracellular 
domain, said extra- or intracellular domain being 
exhibited m one or more copies on each cell's surface 
directly or indirectly interacting with a selection site 
specific for a selection agent, such that upon binding 
of said target compound to said polypeptide of interest 
said selection site of cells which exhibit such a 
binding of said target compound to said polypeptide of 
interest is modified in a manner that said selection 

a g en t m a y b e used for selecting t h o s e ceils whic h 
exhibit said binding of said target compound to said 
polypeptide of interest from oells which do not exhibit 
sa i d b i nd i ng , 

b) exposing said ceJ 1 population to said target compound or 

a population o i said t.aiuet co >mpouru;s lor n pexicd o! 

time sufficient for said target compound to bind to said 

polypeptide of interest , 
■ * ) exposing said co ] 1 popubo i or: s i mu ] t aneous- 1 y with or 

- 1 t t • • r ■ t or ) b ) t ' i s t i i d : < * 1 * o • t i ; : •, ( • < u n t f r . r , > po r i od of 

: l m» s u I i J i 'Mil lew ; - i l d d i j t - i 1 1 to nit eirif! with a l < i 

selection : l t e , 
d) selecting i or clones which shew said binding of said 

target compound to said polypeptide of interest , 
' ■ ! opt o n < t i i v - i ; i o w ; ! i ■ j • ■ . < : i d • • i • ' • • o < o ■ a i ♦ • ; : t . . mu it i p 1 '/ arid 
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2.: The method according to claim 1, wherein said interaction 
between said extra- or intracellular domain with said 
polypeptide of interest and said selection site is a sterical 
interaction, especially a sterical interaction which does not 
allow said selection agent to interact with said select ion site 
after binding of said target compound to said polypeptide of 
interest . 

*> . . „^ + ,A nrf 1-r> ^l^lTTl 1 O Y~ 9. Vjhprpin ^ 3 1 d 

_>.. 1UC H I O .* * ^^^w^^^-.-j - , 

selection site is immediately adjacent to the said extra- or 
intracellular domain with said polypeptide of interest. 

4. : The method according to any one of claims 1 to 3, wherein 
said selection agent is a toxic agent. ~ 

5. : The method according to any one of claims 1 to 4, wherein 
said selection site is an adhesion domain for said selection 
agent . 

6. : The method according to any one of claims 1 to 5, wherein 
said membrane polypeptide is selected from gl ycol lpid-anchored 
polypeptides, cx-helical polytopic receptors, bitopic integral 
membrane polypeptides, polytopic integral membrane polypeptides 
or membrane-bound isoforms of a secreted polypeptide. 

7. : The method according to any one of claims 1 to 6, wherein 
said polypeptide of interest is identified and isolated. 

8. : The method according to any one of claims 1 to 7, wherein 
said extra- or i n r : i: ace 1 1 u 1 a r domain of said membrane polypeptide 
contains a highly variable region which is, upon binding to said 
target compound, identified and isolated as said polypeptide of 
interest or a section thereof. 

9. : The method according to claim B, wherein said highly 
variable domain comprises polypeptides encoded by f j cDNA library 
or sections thereof . 



BNSDOCID -WO 9R30151A1 I - 



WO 99/30151 



PCT/EP98/07922 



- 117- 

10. : The method according to any one of claims I to 9, wherein 
said membrane polypeptide is heterologous for the cells. 

11. : The method according to any one ol claims 1 to 10, wherein 
said membrane polypeptide is genetically engineered to contain 
said ex t r a - o r int r a eel 1 u 1 a i d om a in a n d said selection sit e 
residing adjacent to each other on the cellular surface. 

12. : The method according t c j any one of claims 1 to 11, wherein 
said target compound is composed of a mixture of compounds. 

13. : The method according to any one of claims 1 to 12, 
characterised in that it is carried out. in the form of a high- 
throughput screening assay wherein said -cell population is 
exposed to a multiplicity of target compounds from which an 
unknown binding partner for a polypeptide of interest is to be 
identi f led and isolated . 

14. : The method according to any one of claims 1 to 13, wherein 
said target compound is secreted by the cells expressing said 
membrane polypept ido. 

15. : The* method aoesudmg to any one ;d claims I to I 1 , wherein 
said selection agent is a pathogenic agent. 

16. : The method according to any one ot claims 1 to lb, wherein 
said selection agent is a virus, preferably a bacteriophages 

17. : 7)\(> rr.'dhod j. -<\ ■■ r d i I ;u If. any on- ..: da] ms 1 { o ](., wherein 
s.ii«i pat h':'J-n ic i ■ J t ■ f 1 1 a mod i : i . d s . i t s ■: .< p > r j j , ■ (1 , M nt with a 

d: Iter out et i ee: un ti^- cells than th»- cdi i espoiidinq naturally 
occurring pathogenic agent , pi of ei ably n stronurr of feet than 
t h « 1 natural! y <a -cur i i n ■. i pat tiojen i e agent . 
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19. : The method according to claim 1 8 , wherein steps b) through 
e) are repeated until said cell population comprises a desired 
number of distinct clones. 

20. : The method according to claim 18 or 19, wherein new 
sequence variations in said variable regions of clones selected 
in step d) are introduced before steps b) to e) are repeated. 

2i - t h e mpfhod accordina to anv one of claims 1 to 20, wherein 
said membrane protein exhibits an additional domain on the 
surface of the cell, said additional domain being adjacent to 
both, the variable domain and said selection site and said 
additional domain also interacts, preferably by binding, with 
said target compound. 

22. : The method according to any one of claims 1 to 21, wherein 
said polypeptide of interest bind to said target compound and 
affect its function, comprising screening polypeptides exhibited 
by said clones selected in step d) and amplified in step e)^by 
their ability to affect the function of said target compound. 

23. : The method according to any one of claims 1 to 22, wherein 
said polypeptide of interest exhibits an enzymatic activity, 
comprising a step wherein said cell population is placed in a 
nutrient medium in which each cell's supply of a vital compound 
depends upon that cell's variable region acting enzymat ically 
upon a compound structurally similar, but not identical to said 
target compound. 

24. : The method according to any one of claims 1 to 23, 
wherein said membrane polypeptide is an a-helical polytopic 
receptor, a glycol ipid-anchored piotein, a glycoprotein or a 
prot eog 1 yean . 

25. : Cell population for use in a method according to any one of 
claims 1 to 24 comprising a library of cells, each cell 
exhibiting 

a) a membra no polypeptide which hns said polypeptide of interest 
in its extra- or l n t. r ace I 1 u 1 a i domain and 
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b) a selection site being specific for a selection agent, 
such that upon binding of a target compound to the polypeptide 
of interest said selection site of cells which exhibit such a 
binding of said target compound to said polypeptide of interest 
is modified in a manner that said selection agent may be used 
for selecting those cells which exhibit said binding of said 
target compound to said pol ypept ide of interest from cells which 
do not exhibit said binding. 

26. : Kit comprising 

a) a cell population according to claim 25, 

b) a target compound and/or a library of target compounds and 

c) one or more selection agent (s). 

27. : A method for producing a pharmaceutical preparation, 
comprise ng 

identifying and isolating a novel pharmaceutically active 
compound according to a method according to any one of 
claims 1 to 24, 

adding pha rmaceu t i ca 1 1 y acceptable auxiliary components and 

completing said pharmaceutical preparation 
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Figure 2 
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Figure 3 
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Figure 4 

Dl - D5 are the glycosylated protein's five homologous 
immunoglobulin-like domains. 
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Figure 9 
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Figure 16 
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Figure 17 
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